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CHAPTER  1 


INTRODUCTION 


Most  complex  systems,  be  they  technological  or  social,  are  naturally  modeled  as 
networks,  or,  from  a  mathematical  point  of  view,  graphs,  where  nodes  correspond  to 
discrete  entities  or  components  of  a  system  and  edges  represent  dyadic  relations  be¬ 
tween  components.  Prominent  examples  are  online  social  networks  such  as  Facebook 
and  Google+,  which  make  visible  social  ties  between  individuals.  Another  example 
is  furnished  by  the  online  encyclopedia,  Wikipedia,  which  is  a  substantiation  of  the 
structure  of  human  knowledge.  The  emergence  of  these  networks  has  triggered  an 
enormous  amount  of  interest  in  the  scientific  community,  and  considerable  work  has 
been  devoted  to  their  study. 

One  line  of  study  focuses  on  the  structural  properties  of  networks  themselves.  Ex¬ 
tensive  empirical  study  has  been  conducted  for  a  wide  range  of  social  and  technologi¬ 
cal  networks,  focusing  on  various  network  characteristics,  such  as  degree  distribution, 
reciprocity  and  clustering  coefficient.  Empirical  study  has  led  to  many  interesting  dis¬ 
coveries,  such  as  the  ubiquitous  power-law  degree  distributions  and  the  small  world 
property.  On  the  theoretical  side,  a  variety  of  models,  either  static  or  dynamic,  have 
been  proposed  to  explain  the  empirical  observations  such  as  the  power-law  degree 
distributions  and  the  small  world  property. 

Another  line  of  study,  in  contrast,  focuses  on  dynamic  processes  on  networks.  In 
this  case,  networks  often  serve  as  platforms  for  the  ongoing  processes,  which  usually 
involve  information  dissemination.  The  focus  here  is  on  understanding  the  interaction 
of  network  structural  characteristics  and  dynamic  processes.  For  instance,  network 
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structures  can  affect,  explicitly  or  implicitly,  the  efficiency  of  information  dissemina¬ 
tion,  which  may  in  turn  change  the  network  structures,  as  exemplified  by  the  constant 
follow  and  unfollow  actions  on  the  Twitter  network. 

This  thesis  studies  various  aspects  of  networks  along  the  two  lines  mentioned 
above.  The  goal  is  to  advance  our  understanding  of  structures  and  behaviors  of  large 
scale  complex  networks. 

In  the  first  part  of  the  thesis,  we  focus  on  the  interpretation  of  observed  network 
characteristics.  The  availability  of  vast  amounts  of  large  scale  network  data  has  made 
it  possible  to  study  network  characteristics  and  system  behaviors  as  never  before,  so 
we  constantly  face  the  problem  of  properly  interpreting  empirical  observations.  Differ¬ 
ent  network  properties  are  commonly  studied  and  interpreted  separately.  However, 
they  are  generally  interdependent.  Specifying  one  usually  imposes  nontrivial  con¬ 
straints  on  another.  Taking  such  constraints  into  account  provides  an  additional,  and 
perhaps  more  appropriate,  way  of  interpreting  empirical  observations.  As  a  demon¬ 
stration  of  the  usefulness  of  this  approach,  we  study  the  maximum  reciprocity,  i.e.  the 
maximum  percentage  of  edges  with  a  reciprocal  edge,  that  is  realizable  by  a  network 
with  prescribed  degree  sequence. 

In  the  second  part  of  this  thesis,  we  study  network  growth  dynamics  under  cu¬ 
mulative  advantage,  which  refers  to  the  “rich-get-richer”  phenomenon.  An  example 
of  such  growth  dynamics  is  the  Bianconi-Barabasi  model  [12].  The  growth  process 
can  be  viewed  as  a  competition  for  links,  which,  directly  or  indirectly,  represent  some 
kind  of  resource.  The  evolution  over  time  of  the  competition,  and  in  particular  the 
change  in  leadership,  is  a  very  intricate  process  that  depends  on  the  interplay  of  the 
cumulative  advantage  effect,  individual  competitiveness  or  fitness,  and  randomness. 
We  focus  on  the  relative  leadership  between  two  nodes  and  characterize  the  reduced 
competition  dynamics  in  terms  of  its  duration  and  intensity. 
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In  the  third  part  of  this  thesis,  we  study  information  dissemination  in  social  net¬ 
works.  The  efficiency  of  information  dissemination  as  characterized  by  propagation 
delay  is  affected  by  both  network  topology  and  user  behavior.  We  analyze  the  effi¬ 
ciency  for  both  cooperative  and  selfish  user  behaviors  in  various  network  topologies, 
and  explore  the  design  of  incentive  mechanisms  when  cooperation  is  critical  to  efficient 
information  dissemination. 

1.1  Thesis  Contributions 

This  thesis  makes  the  following  main  contributions. 

•  We  formulate  the  maximum  reciprocity  problem  that  seeks  the  maximum  reci¬ 
procity  realizable  by  a  network  with  given  degree  constraints,  and  prove  its 
NP-hardness.  We  provide  an  upper  bound  together  with  necessary  conditions 
and  sufficient  conditions  for  achieving  the  bound.  We  ford  that  this  bound  is 
surprisingly  close  to  the  empirical  reciprocity  in  a  wide  range  of  real  networks, 
and  that  there  is  a  surprisingly  strong  linear  relationship  between  the  two.  We 
partially  characterize  networks  with  maximum  reciprocity  by  identifying  some 
suboptimal  motifs.  We  demonstrate  that  a  particular  type  of  small  suboptimal 
motifs  called  3-paths  are  the  major  cause  for  suboptimality  in  real  networks. 

•  We  characterize  the  tail  distributions  of  duration  and  intensity  for  pairwise  com¬ 
petition  under  cumulative  advantage.  When  the  two  competitors  are  equally 
competitive  or  fit,  we  obtain  the  exact  asymptotic  distributions.  When  they  are 
not  equally  fit,  we  obtain  asymptotic  bounds  on  the  distributions.  We  demon¬ 
strate  that  duration  always  has  a  power-law  tail  irrespective  of  competitors’ 
fitness,  while  intensity  has  either  a  power-law  tail  or  an  exponential  tail  de¬ 
pending  on  whether  the  competitors  are  equally  fit.  We  observe  the  struggle-of- 
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the-fitness  phenomenon,  where  a  slight  different  in  fitness  results  an  extremely 
heavy  tail  of  duration  distribution. 

•  We  characterize  for  various  network  topologies  the  information  propagation  de¬ 
lay  for  both  cooperative  and  selfish  user  behaviors  and  the  corresponding  price 
of  stability.  We  identify  topologies  where  cooperation  plays  a  critical  role  in 
efficient  information  propagation.  We  propose  an  incentive  mechanism  called 
“plus-one”  to  coax  users  into  cooperation  in  such  cases,  and  demonstrate  its 
effectiveness  through  simulation. 

The  rest  of  this  thesis  is  organized  as  follows.  Chapter  2  presents  our  investigation 
on  reciprocity  in  networks  with  degree  constraints.  Chapter  3  analyzes  competition 
dynamics  under  cumulative  advantage.  Chapter  4  explores  the  efficiency  of  informa¬ 
tion  dissemination  in  social  networks  with  limited  budget  of  attention.  We  conclude 
in  Chapter  5  and  discuss  some  future  directions. 
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CHAPTER  2 


RECIPROCITY  IN  NETWORKS  WITH  DEGREE 

CONSTRAINTS 


2.1  Introduction 

Many  complex  networks  are  naturally  directed,  which  endows  them  with  nontrivial 
structural  properties  not  shared  by  undirected  networks.  One  such  property  that 
has  been  widely  studied  is  reciprocity,  which  is  classically  defined  as  the  fraction 
of  edges  that  are  reciprocated,  i.e.  paired  with  an  edge  of  the  opposite  direction. 
Nontrivial  patterns  of  reciprocity  can  reveal  possible  mechanisms  of  social,  biological 
or  other  nature  that  systematically  act  as  organizing  principles  shaping  the  observed 
network  topology  [31].  Previous  work  shows  that  reciprocity  plays  an  important  role 
in  many  information  networks  such  as  email  networks  [57],  the  World  Wide  Web  [3] 
and  Wikipedia  [81,  82].  It  is  also  shown  that  major  online  social  networks  that  are 
directed  in  nature,  such  as  Twitter[44,  51],  Google+[53],  Flickr  [56,  18],  LiveJournal 
[78,  56,  32],  and  YouTube  [56],  all  exhibit  a  nontrivial  amount  of  reciprocity. 

When  we  try  to  interpret  observed  values  of  reciprocity,  we  are  faced  with  the 
problem  of  assessing  the  significance  of  the  observation.  For  instance,  the  Swedish 
Wikipedia  has  reciprocity  of  21%.  How  significant  is  this?  This  question  is  often 
answered  by  comparing  measured  values  with  the  expected  value  of  some  null  model. 
One  commonly  used  null  model  is  a  random  graph  with  the  same  number  of  nodes 
and  edges  [57].  An  alternative  is  a  random  graph  with  specified  degree  sequence,  as 
the  specific  degree  sequence  is  expected  to  affect  reciprocity  [79].  Networks  are  then 
classified  as  reciprocal  or  anti-reciprocal  according  to  whether  the  observed  reciprocity 
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is  larger  or  smaller  than  the  expected  value  [31].  Significant  deviation  from  the 
expected  values  suggests  the  existence  of  some  underlying  organizational  mechanism 
at  work.  For  our  example  of  Swedish  Wikipedia,  the  expected  reciprocities  in  both 
random  null  models  are  almost  zero,  so  the  Swedish  Wikipedia  is  classified  as  a 
reciprocal  network.  Informative  as  this  might  be,  comparison  with  expected  values 
is  not  the  whole  story.  Is  21%  a  significant  deviation  from  0?  Can  we  say  that 
the  tendency  to  reciprocate  is  strong  in  this  network?  The  answer  might  depend 
on  the  eye  of  the  beholder.  However,  if  we  know  for  some  reason  the  maximum 
possible  reciprocity  is  only  28%,  then  we  may  safely  conclude  that  21%  is  indeed  a 
significant  amount  of  reciprocity.  On  the  other  hand,  if  the  maximum  is  90%,  we 
might  conclude  that  21%  is  not  as  significant  as  suggested  by  the  comparison  with 
random  null  models.  In  general,  knowledge  of  the  extremal  values  can  give  a  better 
idea  about  where  the  observation  lies  in  the  entire  spectrum,  which  can  potentially 
change  our  conclusion  about  the  significance  level  of  the  observation. 

Since  real  social  networks  often  exhibit  reciprocities  larger  than  those  associated 
with  the  random  null  models,  we  concern  ourselves  only  with  the  maximum  achiev¬ 
able  reciprocity  in  this  chapter.  As  in  the  random  null  models,  we  may  want  to  retain 
certain  key  structural  features  of  the  real  network  when  we  maximize  reciprocity. 
The  particular  feature  that  we  choose  to  preserve  here  is  the  joint  in-  and  out-degree 
sequence,  which  is  a  confounding  factor  in  the  study  of  reciprocity  [79].  In  real  net¬ 
works,  in-  and  out-degrees  often  serve  as  proxies  for  some  kind  of  capacities  of  the 
corresponding  node.  For  example,  in  a  file  sharing  network  where  edges  represent 
transfers  from  file  sources  to  downloaders,  the  in-degree  of  a  node  can  reflect  the 
available  network  bandwidth  and  the  out-degree  the  amount  of  resource.  In  a  social 
network  where  edges  point  from  followers  to  followees,  the  in-degree  of  a  node  can 
reflect  its  fame  and  popularity  and  the  out-degree  its  budget  of  attention.  Quite  often 
these  capacity  constraints  are  too  important  to  be  ignored  in  the  network  under  con- 
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sideration.  By  preserving  the  degree  sequence,  we  honor  these  capacity  constraints, 
thus  controlling  these  confounding  factors. 

Motivated  by  the  above  considerations,  we  study  the  problem  of  maximizing  reci¬ 
procity  subject  to  prescribed  joint  in-  and  out-degree  constraints.  We  make  the 
following  contributions  in  this  chapter. 

•  We  formulate  the  maximum  reciprocity  problem.  We  provide  a  simple  upper 
bound  on  reciprocity  and  prove  that  it  is  NP-complete  to  decide  the  achievability 
of  the  bound.  We  also  identify  sufficient  conditions  for  achieving  the  bound. 

•  We  demonstrate  that  the  upper  bound  is  surprisingly  close  to  the  empirical 
reciprocity  in  a  wide  range  of  real  networks,  which  suggests  that  the  tendency 
to  form  reciprocal  edges  might  be  much  stronger  than  the  observed  reciprocity 
indicates. 

•  We  identify  some  suboptimal  network  motifs  that  can  be  eliminated  to  increase 
reciprocity,  thus  providing  a  partial  characterization  of  networks  with  maximum 
reciprocity.  Based  on  a  particular  type  of  small  suboptimal  motif  called  3- 
paths,  we  provide  a  greedy  algorithm  GreedyRewire  to  maximize  reciprocity. 
We  demonstrate  that  3-paths  are  the  major  cause  for  suboptimality  in  real 
networks. 

•  We  find  a  surprisingly  strong  linear  relationship  between  the  empirical  reci¬ 
procity  and  the  upper  bound  across  a  wide  range  of  real  networks.  We  also  find 
a  similar  linear  relationship  between  the  number  of  reciprocated  edges  and  the 
corresponding  upper  bound  on  the  logarithmic  scale. 

The  rest  of  the  chapter  is  organized  as  follows.  Section  2.2  introduces  the  max¬ 
imum  reciprocity  problem.  Section  2.3  proves  the  NP-hardness  of  the  problem,  and 
provides  a  simple  upper  bound  for  the  maximum  reciprocity.  Section  2.4  identi¬ 
fies  patterns  of  maximum  digraphs  and  provides  a  greedy  algorithm  for  eliminating 
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suboptimal  motifs.  Section  2.5  conducts  some  empirical  study  of  real  networks  and 
Section  2.6  concludes  this  chapter. 


2.2  Graphic  Sequences  and  Maximum  Reciprocity  Problem 

In  this  section,  we  first  introduce  the  notion  of  a  graphic  sequence  for  undirected 
graphs  and  then  a  graphic  bi-sequence  for  directed  graphs  or  digraphs  for  short,  which 
will  be  used  in  the  theoretical  analysis  of  Section  2.3.  We  then  formulate  the  maximum 
reciprocity  problem.  Throughout  the  rest  of  the  chapter,  a  graph,  directed  or  not, 
always  means  a  simple  graph,  i.e.  no  self-loops  or  multiple  edges  are  allowed.  We  will 
use  the  terms  node  and  vertex  interchangeably.  For  directed  graphs,  an  edge  always 
means  a  directed  edge. 

2.2.1  Graphic  Degree  and  Bi-degree  Sequences 

For  an  undirected  graph  G  =  (V,  E ),  the  degree  da(v)  of  a  node  v  is  the  number  of 
edges  incident  to  v.  Associated  with  every  graph  G  is  a  sequence  d  =  {d( j(v)  :  v  e  V} 
of  its  degrees.  However,  not  every  sequence  of  nonnegative  integers  can  be  realized 
by  a  graph.  When  it  is  realizable,  the  sequence  is  called  graphic.  More  precisely,  a 
sequence  of  nonnegative  integers  d  =  (d1;  d2,  ■  ■  ■ ,  dn)  is  called  graphic  if  there  exists 
a  graph  G  with  nodes  v\,  v2,  ■  ■  ■ ,  vn  such  that  dcfig )  =  dt  for  i  =  1,2, ... ,  n.  The 
following  classical  theorem  of  Erdos  and  Gallai  [27]  characterizes  graphic  sequences. 

Theorem  2.2.1  (Erdos-Gallai;  Theorem  6.6  in  [11]).  A  sequence  of  nonnegative 
integers  d\  >  d2  >  ■  ■  ■  >  dn  is  graphic  if  and  only  if  Y^i=i  di  * s  even  and 

k  n 

y]  di  <  k(k  —  1)  +  min{dj,  k},  for  k  —  1,  2, . . . ,  n. 

i=  1  i=k-\- 1 

The  graphicality  of  a  sequence  can  be  tested  in  linear  time  using  the  Erdos-Gallai 
theorem  [40].  It  can  also  be  tested  using  the  constructive  Havel-Hakimi  algorithm  in 
0(n2logn)  time  [37,  36]. 
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For  a  digraph  G  =  ( V,E ),  a  node  has  both  an  in-degree  and  an  out-degree.  The 
in-degree  df(v )  of  a  node  v  is  the  number  of  directed  edges  coming  into  v,  and  the 
out-degree  (Iq(v)  is  the  number  of  directed  edges  going  out  of  v.  Associated  with  every 
digraph  G  is  a  bi-sequence  (d+,d~),  where  d+  =  {cIq(v)  :  v  G  V}  is  the  out-degree 
sequence  and  d~  =  {(Iq(v)  :  v  G  V}  is  the  in-degree  sequence.  As  in  the  undirected 
case,  not  every  bi-sequence  of  nonnegative  integers  can  be  realized  by  a  digraph.  A 
bi-sequence  of  nonnegative  integers  (d+,  d~)  =  {(df,  df,  ■  ■  ■ ,  df),  (df ,  df , . . . ,  d~ )}  is 
called  graphic  if  there  exists  a  digraph  G  with  nodes  iq,  rq, . . . ,  vn  such  that  dg(ry)  = 
df  and  df.(ry)  =  d~  for  i  =  1,2,  .  . . ,  n.  The  Fulkerson-Chen-Anstee  theorem  is  the 
analog  of  the  Erdos-Gallai  theorem  for  graphic  bi-sequences  [30,  19,  5]. 

Theorem  2.2.2  (Fulkerson-Chen-Anstee).  A  bi-sequence  {(d^, . . . ,  d+),  (df, . . . ,  d~ )} 
with  df>d£>--->d+  is  graphic  if  and  only  if  YH.=i  =  Y2i=  i  df  and 

k  k  n 

^df  <Y^  vdn{d-,  k  -  1}  +  minK")  k }>  fork  =  1,2,...,  n. 

i= 1  i= 1  i=k- (-1 

The  condition  of  the  Fulkerson-Chen-Anstee  theorem  can  be  tested  in  0(n2)  time. 
The  graphicality  of  bi-sequence  can  also  be  tested  using  the  constructive  Kleitman- 
Wang  algorithm  in  0(n2  log  n)  time  [48]. 

2.2.2  Maximum  Reciprocity  Problem 

In  this  subsection,  we  formulate  the  maximum  reciprocity  problem.  For  notational 
simplicity,  we  henceforth  make  no  distinction  between  a  graph  (digraph)  and  its  edge 
set  when  no  confusion  arises. 

Given  a  digraph  G,  let  Gs  be  the  symmetric  subgraph  of  G.  i.e.  (i,j)  €  Gs  if  and 
only  if  both  (i,j)  G  G  and  (j,i)  G  G.  The  reciprocated  edges  of  a  digraph  G  are 
precisely  those  of  Gs.  Thus  the  number  p(G)  of  reciprocated  edges  in  G  is  given  by 
p(G)  =  |Crs|,  and  the  reciprocity  of  G  is  r(G)  :=  p(G)/\G\.  Note  that  we  use  |G|  to 
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denote  the  number  of  edges  in  G  and  that  each  pair  of  reciprocal  edges  contributes 
two  to  p(G). 

Given  a  graphic  bi-sequence  (d+,d_),  let  C/(d+,d“)  denote  the  nonempty  set  of 
graphs  that  have  (d+,  d_)  as  their  degree  bi-sequence.  Since  the  total  number  of  edges 
is  fixed  for  a  given  graphic  bi-sequence,  maximizing  r(G)  is  the  same  as  maximizing 
p(G).  The  maximum  reciprocity  problem  is  then  to  find  a  digraph  G  in  C/(d+,d“) 
with  maximum  p(G),  i.e. 


maximize  p{G) 
subject  to  G  G  t/(d+,  d~). 

We  denote  the  maximum  value  by  p(d+,d~)  and  refer  to  a  digraph  G  with  p(G)  = 
p(d+,d~)  as  a  maximum  reciprocity  digraph  or  maximum  digraph  for  short. 

2.2.3  Some  Notations 

We  collect  here  some  notations  for  later  reference.  Let  G  denote  a  generic  digraph. 

•  Let  Ga  be  the  anti-symmetric  subgraph  of  G ,  i.e.  (i,j)  G  Ga  if  and  only  if 
(i,j)  €  G  but  (j,i)  G.  Note  that  G  =  Gs  +  Ga  and  Gs  D  Ga  =  0,  i.e.  G  is  the 
edge  disjoint  union  of  Gs  and  Ga. 

•  Let  Gu  be  the  undirected  graph  obtained  by  symmetrizing  G,  i.e.  (i,j)  G  Gu  if 
either  (i,j)  G  G  or  (j,i)  G  G. 

Let  (d+,d~)  be  a  graphic  bi-sequence. 

•  The  min  sequence  is 

d+  A  d~  =  (df  A  di  ,d£  A  d%  , _ ,  A  d~), 

where  a  A  b  =  rninja,  b}. 
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•  The  max  sequence  is 


d+  V  d”  =  (df  V  d{,  d%  V  dj, . . . ,  V  d~), 

where  a  V  b  =  max{a,  b}. 

•  The  total  number  of  edges  is 

e(d+,  d”)  =  dt  =  di- 

i  i 

•  The  total  balanced  degree  is 

/3(d+,d-)  =  ^i,+  Ad-, 

i 

which  is  the  f^-norm  of  the  min  sequence. 

•  The  total  unbalanced  degree  is 

z/(d+,d")  =  \^\dt 

i 

which  is  the  total  variation  distance  between  d+  and  d_.  Note  that  £(d+,  d~)  = 
/3(d+,  d~)  +  z/( d+,  d”). 

2.3  Hardness  Analysis  and  Bounds 

In  this  section,  we  first  provide  an  upper  bound  for  the  maximum  number  of 
reciprocated  edges  allowed  by  a  graphic  bi-sequence.  We  then  prove  that  the  maxi¬ 
mum  reciprocity  problem  is  NP-hard  by  showing  that  it  is  NP-complete  to  decide  the 
achievability  of  the  upper  bound.  Some  sufficient  conditions  for  achieving  the  upper 
bound  are  then  provided. 
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2.3.1  Upper  Bound  for  Reciprocity 

In  this  subsection,  we  first  establish  a  simple  upper  bound  on  the  maximum  num¬ 
ber  of  reciprocal  edges  in  terms  of  the  total  balanced  degree  of  the  graphic  bi-sequence, 
along  with  necessary  conditions  for  achieving  this  upper  bound.  Some  examples  are 
provided  to  illustrate  how  the  necessary  conditions  may  fail  and  that  they  are  not 
sufficient,  which  provides  insight  into  why  the  bound  is  not  always  tight. 

Proposition  2.3.1.  The  number  of  reciprocated  edges  in  any  digraph  with  a  given 
degree  bi-sequence  cannot  exceed  the  total  balanced  degree,  i.e. 

P(d+,d-)</j(d+.d-). 

A  necessary  condition  for  equality  is  that  both  d+  A  d  and  d+  V  d_  be  graphic. 

Proof.  Let  G  G  f/(d+,d“)  be  a  maximum  digraph.  Note  that  the  number  of  recip¬ 
rocated  edges  going  out  of  a  node  v  is  at  most  (Tq{v)  A  cIq(v).  The  desired  bound  is 
obtained  by  summing  over  v. 

If  equality  holds,  then  Gs,  as  an  undirected  graph,  has  degree  sequence  d+ Ad", 
and  Gu  has  degree  sequence  d+  V  d  .  Thus  both  d+  A  d“  and  d+  V  d~  are  graphic.  □ 

Note  that  it  is  possible  that  neither  d+  A  d"  nor  d+  V  d_  is  graphic.  In  fact,  one 
sequence  can  fail  to  be  graphic  independently  of  whether  the  other  is  graphic  or  not, 
as  illustrated  by  the  following  examples,  where  graphic  bi-sequences  are  shown  along 
with  the  corresponding  maximum  digraphs. 

Example  2.3.2.  In  Figure  2.1,  neither  the  min  sequence  d+  A  d”  nor  the  max  se¬ 
quence  d+  V  d_  is  graphic,  since  they  both  have  odd  sums.  Here  p(d+,d“)  =  2  < 
j3(d+,  d-)  =  3. 

Example  2.3.3.  In  Figure  2.2,  the  min  sequence  d+  A  d”  is  graphic,  while  the  max 
sequence  d+Vd”  is  not.  No  reciprocity  is  allowed  by  this  bi-sequence,  i.e.  p(d+,  d~)  = 
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df  A  d~ 

dt  V  d~ 

1 

(1.0) 

0 

1 

2 

(1.1) 

1 

1 

3 

(0.2) 

0 

2 

4 

(2,1) 

1 

2 

5 

(1.1) 

1 

1 

Figure  2.1:  Graphic  bi-sequence  with  non-graphic  max  and  min  sequences. 


0,  while  the  upper  bound  gives  /3(d+,d”)  =  2 n,  so  the  gap  can  be  arbitrarily  large. 
The  only  unbalanced  nodes  s  and  r  have  very  large  unbalanced  degrees  that  cannot  be 
absorbed  by  themselves,  as  a  consequence  of  which  some,  in  fact  all,  balanced  degrees 
have  to  be  used  for  absorbing  unbalanced  degrees  rather  than  forming  reciprocal  edges. 

i  (df,dr)  djAd~  djMdf 

s  (2 n,  0)  0  2 n 

1  ~  2  n  (1, 1)  1  1 

r  (0, 2 n)  0  2 n 

Figure  2.2:  Graphic  bi-sequence  with  graphic  min  sequence  but  non-graphic  max 
sequence. 


Example  2.3.4.  In  Figure  2.3,  the  max  sequence  d+  V  d  is  graphic,  while  the  min 
sequence  d+  Ad  is  not.  As  in  Example  2.3.3,  no  reciprocity  is  allowed  here,  i.e. 
p(d+,d”)  =  0,  while  the  upper  bound  is  /3(d+,d”)  =  2 n.  The  situation  is,  however, 
the  opposite.  Node  0  has  too  large  a  balanced  degree  relative  to  the  number  of  nodes 
with  nonzero  balanced  degrees,  which  is  one  here.  Thus  some  of  the  balanced  degrees 
have  to  be  absorbed  by  the  unbalanced  degrees. 
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i  (df,df) 

df  A  dj 

df  V  d~ 

si~s2n  (1,0) 

0 
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f  i  ~  r2n  (0, 1) 

0 

1 

0  (2 n,  2 n) 

2  n 

2  n 

Figure  2.3:  Graphic  bi-sequence  with  graphic  max  sequence  but  non-graphic  min 
sequence. 


The  common  pattern  in  Examples  2.3.3  and  2.3.4  is  that  there  are  a  small  number 
of  nodes  with  extremely  large  degrees.  In  the  social  network  context,  these  nodes 
correspond  to  celebrities  (node  r  in  Figure  2.3.3),  information  aggregators  (node  s  in 
Figure  2.3.3),  or  middlemen  (node  0  in  Figure  2.3.4).  These  large  degree  nodes  often 
incur  inevitable  reduction  of  reciprocity  from  the  upper  bound. 

The  next  example  shows  that  the  necessary  condition  in  Proposition  2.3.1  is  not 
sufficient. 

Example  2.3.5.  For  the  bi-sequence  (df,df)  =  (n  —  i,i),  i  —  0, 1, . . . ,  n,  the  upper 
bound  is  /3(d+,  d“)  =  |_n/2j  •  \n/ 2] .  When  n  is  a  multiple  of  f,  both  the  max  sequence 
d+  V  d  and  the  min  sequence  d+  A  d  are  graphic.  However,  p(d+,  d  )  =  0,  as  the 
only  digraph  in  f/(d+,d~),  of  which  ( i,j )  is  an  edge  if  and  only  if  i  <  j,  has  zero 
reciprocity;  see  Figure  2.f. 

2.3.2  Proof  of  NP-hardness 

We  saw  in  the  previous  subsection  that  the  upper  bound  may  not  be  achievable. 
Unfortunately,  the  next  theorem  shows  that  it  is  NP-complcte  to  decide  whether 
the  upper  bound  is  achievable,  which  means  the  maximum  reciprocity  problem  is 
NP-hard. 

Theorem  2.3.6.  The  decision  problem  whether  p(d+,  d~)  =  /3(d+ ,  d”)  is  NP-complete. 
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i 

(dt,d~)  dfAdr  d+VdJ 

0  ~  |n/2j 

(■ n  —  i,  i)  i  n  —  i 

\n/ 2]  ~  n 

(■ n  —  i,i)  n  —  i  i 

Figure  2.4:  Insufficiency  of  necessary  condition  in  Proposition  2.3.1. 


Proof.  Note  that  the  problem  is  the  same  as  the  existence  problem  of  a  digraph 
G  G  Q{ d+,  d~)  with  p(G)  =  /3(d+,  d~).  This  problem  is  in  NP,  since  given  a  digraph 
G,  we  can  verify  whether  p(G)  =  /3(d+,d~)  in  polynomial  time.  To  show  that  the 
problem  is  NP-hard,  we  adapt  the  proof  of  Lemma  5  in  [24]  by  reduction  from  the 
3-color  tomography  problem,  which  is  shown  to  be  NP-hard  therein. 

Recall  the  3-color  tomography  problem  is  as  follows.  Given  nonnegative  integral 
vectors  rw,  rb  G  Nn,  and  sw,  sb  G  Nm  that  satisfy 

rf  +  rb  <  m,  sj  +  sb  <  n ,  for  1  <  i  <  n,  1  <  j  <  m, 

and 

ri  =  SP  for  c  G  b }> 

*  3 

decide  if  (rw,  rb,  sw,  sb )  is  feasible,  i.e.  there  exists  a  matrix  M  with  entries  in  {tc,  b ,  g} 
such  that 

ri  =  I  {j  ■  Ma  =  c}\,  scj  =  I  {Mij  =  c}|,  for  c  G  {w,  b}. 

Let  (rw ,  rb ,  sw ,  sb)  be  an  n  x  m  instance  of  the  3-color  tomography  problem.  For 
1  <  i  <  n  and  1  <  j  <  m,  let 
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(2.1) 


4  =  if  +  l  +  n  -  1,  <+j  =  s”, 

d-  =  r»  +  „-i,  d;+J.  =  s7  +  4 


Now  we  show  that  the  instance  (rw ,  rb ,  sw ,  sb)  is  feasible  if  and  only  if  (d+,d~)  is 
graphic  and  p(d+,  d”)  =  /3(d+,  d_),  where  /3(d+,  d~)  =  n(n  —  1)  +  2  Y17=i  rT- 

First  assume  that  M  is  a  solution  to  the  3-color  tomography  instance.  We  con¬ 
struct  a  digraph  G  as  follows.  For  1  <  i  <  n  and  1  <  j  <  m,  let  Wl}  =  1  if  Mi3  =  w, 
and  Bij  =  1  if  M%3  =  b.  Let  J  be  an  n  x  n  matrix  with  all  off-diagonal  entries  equal 
to  1  and  diagonal  entries  equal  to  0.  Let  the  adjacency  matrix  of  G  be 


/ 


J 


W  +  B 


\ 


It  is  straightforward  to  verify  that  G  G  t/(d+,d~)  and  p(G )  =  /3(d+,d_). 

For  the  reverse  direction,  assume  that  (d+,  d~)  is  graphic  and  p(d+,  d~)  =  /3(d+,  d~). 
Then  there  exists  a  digraph  G  G  p(d+,d~)  with  p(G)  =  /3(d+,d_).  Divide  the  adja¬ 
cency  matrix  of  G  into  the  following  block  form 


G 


G  ii 
G2i 


G 12 

G22 


\ 


where  Gn  is  n  x  n  and  G22  is  m  x  m. 

Let  <f>  =  J2'j=idj  ~  Yl'iLi^n+ii  which,  by  (2.1),  evaluates  to  n(n  —  1).  On  the 
other  hand,  dj  =  Y^k=\  G(k,j)  and  d++i  =  Y^'iXT  G{n  +  i,k ),  so 

n  n  mm 

$  =  ~^2^2G22(i,j)  <  n(n  —  1)  =  $, 

i=  1  j= 1  i=  1  j= 1 
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where  the  inequality  follows  from  the  facts  that  Gu(i,j)  <  1,  Gn(i,i)  =  0  and 
G22(i,j)  >  0.  Since  the  equality  holds,  we  must  have  Gn  =  J  and  G22  =  0.  Thus 

n  m 

p(G)  =  n(n  —  1)  +  2  ^  ^  Gi2(i,  j)G2i(j,  i ) 

i=  1  j= 1 
n  m 

<  n(n  -  !)  +  2  ^  ^  G'21  (j,  i) 
i=  1  j=l 

m 

=  n{n  -  1)  +  2  d++j  =  /3(d+,  d")  =  p(G). 
j=i 

Since  the  equality  holds,  Gi2(i,j)  >  G'2i  (j,  i)-  Thus  G\2  =  W  -\-B  and  G2i  =  WT  for 
some  (0,  l)-matrices  W  and  B.  Let  My  =  w  if  W(i,j)  =  1,  and  Ml3  =  b  if  Bl3  =  1. 
Then  M  is  a  solution  to  the  3-color  tomography  instance. 

Thus  we  have  shown  that  it  is  NP-complete  to  decide  whether  (d+,  d”)  is  graphic 
and  p(d+,d~)  =  /3(d+,d“).  Since  the  graphicality  of  (d+,d_)  can  be  tested  in 
quadratic  time  using  the  Fulkerson- Chen- Anstee  theorem,  it  rnnst  be  NP-complete  to 
decide  whether  p(d+,d~)  =  /3(d+,d~).  □ 

2.3.3  Sufficient  Conditions  for  Achieving  Upper  Bound 

Given  the  hardness  of  the  maximum  reciprocity  problem,  we  provide  some  suffi¬ 
cient  conditions  for  achieving  the  upper  bound  in  Proposition  2.3.1.  We  start  with  the 
following  slightly  more  general  theorem,  which  may  be  used  to  lower  bound  p(d+,  d“). 

Theorem  2.3.7.  p(d+,d~)  >  2m  if  there  exists  a  sequence  d°  such  that 

(1) .  d°  is  graphic  with  df  >  m, 

(2) .  the  residual  bi-sequence  (d+  —  d°,d~  —  d°)  is  also  graphic, 

(3) .  A  <  yj Sn  +  -  \)2  +  |  -  5,  where  n  =  |Vo|,  A  =  \/iGVo(dl  +  d~  -  d°)  and 

5  =  Aievb  (dt  +  d~  -  d(f),  with  V0  =  {i  :  df  V  d~  >  0}. 
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This  theorem  is  analogous  to  Theorem  2.2  in  [16],  which  deals  with  packing  two 
graphic  sequences  for  undirected  graphs.  Theorem  2.3.7  deals  with  packing  a  graphic 
sequence  d°  for  undirected  graphs  and  a  graphic  bi-sequence  (d+  —  d°,  d"  —  d°)  for 
digraphs.  The  proof  is  deferred  to  Appendix  A.l. 

Applying  Theorem  2.3.7  with  d°  =  d+  A  d_,  we  obtain  the  following  sufficient 
conditions  for  achieving  the  upper  bound  in  Proposition  2.3.1. 

Corollary  2.3.8.  p(d+,d~)  =  /3(d+,d~)  if  the  following  conditions  hold, 

(1) .  d+  A  d~  mid  (d+  —  d+  A  d~,  d~  —  d+  A  d_)  are  graphic; 

(2) .  A  <  5n  +  (8  —  |)2  +  |  —  6,  where  n  =  \V0\,  A  =  \/ieVo(df  V  d~)  and  6  = 

AieVo  idt  v  df)>  with  Vo  =  {i'-  df  V  d~  >  0}. 

Note  that  A  is  the  maximum  of  either  the  in-  or  out-degrees.  Putting  an  upper 
bound  on  A  rules  out  extremely  large  degrees,  which  are  the  trouble  makers  in  the 
examples  of  Section  2.3.1.  However,  in  most  real  networks,  we  have  8  —  1,  so  the 
sufficient  condition  essentially  requires  A  <  y/n,  which,  unfortunately,  usually  fails 
to  hold.  In  fact,  it  fails  for  most  networks  studied  in  Section  2.5. 

2.4  Patterns  in  Maximum  Digraphs 

In  this  section,  we  identify  some  structural  patterns  of  maximum  digraphs,  or 
equivalently,  the  associated  suboptimal  structures  that  contribute  to  the  loss  in  reci¬ 
procity  that  is  not  imposed  by  the  degree  bi-sequence.  We  first  look  at  some  small 
suboptimal  motifs  and  provide  a  greedy  algorithm  to  eliminate  them.  We  then  show 
some  more  complicated  structural  patterns  of  maximum  digraphs  and  demonstrate 
how  they  can  help  us  pin  down  the  maximum  digraphs  in  some  special  cases. 

Throughout  this  section,  a  cycle  or  a  path  always  refers  to  a  directed  cycle  or 
directed  path,  i.e.  the  edges  must  be  all  in  the  same  direction  as  we  follow  the  cycle 
or  path.  We  also  require  that  the  edges  be  distinct.  On  the  other  hand,  the  vertices 
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Vo  - X— ►  V!  Vo  - ►  fl 

t'' 

'  i 

1  i 

1  l 

;  w 

v3  -«-X —  v2  v3  + - v2 

(c)  Type  III.  (d)  Type  IV. 

Figure  2.5:  Different  types  of  3-paths  with  corresponding  rewirings.  The  edges  marked 
by  red  crosses  are  to  be  rewired  into  the  dashed  green  edges. 

are  not  necessarily  distinct.  When  the  vertices  are  distinct,  we  say  the  path  or  cycle 
is  elementary. 

2.4.1  Small  Suboptimal  Motifs 

In  this  subsection,  we  focus  on  a  particular  type  of  small  motifs  that  we  call  3- 
paths,  the  nonexistence  of  which  also  guarantees  the  nonexistence  of  many  larger  scale 
suboptimal  structures.  As  we  will  see  in  Section  2.5,  elimination  of  such  suboptimal 
motifs  brings  reciprocity  close  to  the  corresponding  upper  bound  for  a  variety  of  real 
world  networks. 

Given  a  digraph  G,  we  call  an  elementary  path  of  length  3,  7 r  =  {vq,vi,v2,v3),  a 
3-path  if  ( Vi ,  vi+i)  G  Ga  for  i  —  0, 1,  2,  i.e.,  7r  consists  entirely  of  unreciprocated  edges. 
We  further  classify  3-paths  into  the  following  four  types  according  to  the  connectivity 
between  v0  and  v3, 

(I).  (vo,v3)  ^  Gu,  i.e.  there  is  no  edge  between  Vo  and  v3 ; 

(II).  (r;0,  v3)  G  Gs ; 

(III) .  (v3,v0)  G  Ga,  i.e.  (v0,v1,v2,v3,v0)  is  a  4-cycle; 

(IV) .  (r>cb  ^3 )  £  Ga. 


Vo  — Vi 


v3  —  v2 


(a)  Type  I. 


Vo 


v3 


Vi 

X 

v2 


(b)  Type  II. 
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As  shown  in  Figure  2.5,  3-paths  of  Types  I,  II  and  III  are  suboptimal  and  can 
be  rewired  locally  to  increase  reciprocity.  We  say  a  digraph  is  3-path  optimal  if  it 
has  no  3-path  of  Type  I,  II  or  III.  Note  that  when  viewed  as  a  transformation  on 
Ga,  the  rewiring  procedure  in  Figure  2.5  simply  eliminates  4-cycles  (Type  III),  and 
replaces  open  3-paths  by  a  shortcut  from  its  first  vertex  to  its  last  vertex  if  such  a 
shortcut  does  not  yet  exist  (Types  I  and  II).  Thus  each  rewiring  increases  the  number 
of  reciprocated  edges  by  either  2  or  4,  and  we  have  the  following 

Lemma  2.4.1.  A  maximum  digraph  is  3-path  optimal. 

Given  a  digraph  G,  we  can  greedily  rewire  all  3-paths  to  get  a  lower  bound  on  the 
maximum  reciprocity  allowed  by  the  degree  bi-sequence  of  G.  The  resulting  greedy 
algorithm  is  shown  in  Algorithm  1. 


Algorithm  1  GreedyRewire 

Input:  G  =  (V,  E) 

1:  S<-V 
2:  while  S'  7^  0  do 
3:  pick  Vo  £  S 

4:  if  there  exists  non- Type  IV  3-path  n  =  (vo,vi,V2,vs)  then 

5:  G  <—  Rewire(7r) 

6:  S  i —  S  U  {vi,v2} 

7:  else 

8:  S  «-  S  -  {u0} 

9:  end  if 

10:  end  while 
11:  return  G 


Proposition  2.4.2  guarantees  that  Algorithm  1  eliminates  all  3-paths  of  Types  I, 
II  and  III.  The  proof  is  deferred  to  Appendix  A. 2. 

Proposition  2.4.2.  Algorithm  1  returns  a  3-path  optimal  digraph. 

Note  that  depending  on  how  ty  and  n  are  picked,  Algorithm  1  can  return  different 
3-path  optimal  graphs.  Although  there  is  no  theoretical  guarantee,  we  will  see  in 
Section  2.5  that  reciprocities  of  3-path  optimal  digraphs  returned  by  Algorithm  1  are 
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very  close  to  the  corresponding  upper  bounds  and  hence  close  to  the  maxima  as  well. 
The  next  subsection  shows  that  3-path  optimality  precludes  many  other  suboptimal 
structures,  which  partially  explains  why  Algorithm  1  works  pretty  well  in  practice. 

2.4.2  Properties  of  Maximum  Digraphs 

In  this  subsection,  we  consider  additional  suboptimal  structures  that  are  more 
complicated  than  3-paths.  Some  of  these  structures  are  automatically  eliminated 
by  Algorithm  1,  while  others  require  extra  attention.  We  will  state  the  results  as 
properties  of  maximum  digraphs.  Any  violation  of  the  stated  properties  yields  a 
suboptimal  structure. 

2.4. 2.1  3-path  Optimal  Digraphs 

We  first  consider  some  properties  of  3-path  optimal  digraphs,  which,  by  Lemma  2.4.1, 
are  also  properties  of  maximum  digraphs.  All  these  properties  involve  only  unrecip¬ 
rocated  edges.  Note  that  any  suboptimal  structures  that  violate  these  properties 
are  automatically  eliminated  by  Algorithm  1.  Let  G  denote  a  3-path  optimal  digraph 
throughout  this  subsection.  Recall  that  in  a  3-path  optimal  digraph,  the  only  possible 
3-path  is  of  Type  IV. 

Lemma  2.4.3  shows  that  the  unreciprocated  edges  of  a  3-path  optimal  digraph 
cannot  form  any  elementary  path  of  odd  length  without  a  shortcut.  As  a  result,  for 
any  two  vertices  u  and  v,  either  there  is  no  path  from  u  to  v  in  Ga,  or  there  is  such 
a  path  of  length  at  most  2. 

Lemma  2.4.3.  If  tt  =  (vo,v\, . . . ,  v-2P+  i  )  is  an  elementary  path  of  odd  length  in  Ga, 
then  Oo,u2p+i)  e  Ga. 

Proof.  We  use  induction  on  p.  If  p  =  0,  then  (vo,vi)  E  Ga  by  assumption.  If  p  —  1, 
then  7r  is  a  3-path  of  Type  IV  and  hence  ( v0 ,  v3)  E  Ga.  Now  consider  p  >  2.  We  have 
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(u0,  V2P-\)  G  Ga  by  the  induction  hypothesis.  Then  (v0,  u2p_i,  v2p,  u2p+i)  is  a  3-path 
of  Type  IV.  Thus  (■ v0,v2p+i )  G  Ga.  □ 

Lemma  2.4.4  shows  that  the  anti-symmetric  subgraph  of  a  3-path  optimal  digraph 
is  almost  cycle  free.  We  can  obtain  a  directed  acyclic  graph  from  it  by  removing  an 
edge  from  each  3-cycle. 

Lemma  2.4.4.  The  only  possible  cycles  in  Ga  are  3-cycles,  and  any  two  distinct 
3-cycles  must  be  vertex  disjoint. 

Proof.  We  first  prove  that  two  distinct  3-cycles  must  be  vertex  disjoint  by  contradic¬ 
tion.  Suppose  they  share  at  least  one  vertex  vq.  Let  the  cycles  be  Co  =  (vo,vi,V2,vo) 
and  C i  =  (v0,v3,V4,v0).  Note  that  v\  ^  vA  and  v2  ^  v3,  as  all  edges  are  in  Ga.  Since 
Co  and  C\  are  distinct,  we  must  have  either  v\  ^  v3  or  v2  ^  v±.  Without  loss  of  gen¬ 
erality,  assume  v\  ^  v3.  Then  (vi,v2,Vo,v3)  is  a  3-path  of  Type  IV,  so  (vi,v3)  G  Ga- 
But  then  (vi,  v3,  v±,  v0)  is  a  3-path  of  Type  III,  which  is  impossible.  Therefore,  C0 
and  C\  must  be  vertex  disjoint. 

Next  we  prove  there  are  no  elementary  k- cycles  for  k  >  4.  Suppose  there  is 
such  a  cycle  (v0,  . . . ,  Vk-i,  u0).  If  k  is  even,  (ui,ffc_2)  G  Ga  by  Lemma  2.4.3. 
But  (vo,vi,Vk-2,Vk-i)  is  a  3-path  of  Type  III,  which  is  impossible.  If  k  is  odd, 
then  (uoWfc-2),  (ui,Ufc_i)  G  Ga  again  by  Lemma  2.4.3.  But  then  (v0,  Vi,  v^-i,  v0)  and 
(vo,Vk-2,Vk-i,vo)  are  two  distinct  3-cycles  with  two  common  vertices,  which  is  again 
impossible. 

Finally,  suppose  there  is  a  non-elementary  cycle.  We  can  be  decompose  it  into 
several  distinct  elementary  cycles,  all  of  which  must  be  3-cycles  by  the  previous  para¬ 
graph.  But  then  we  have  distinct  3-cycles  that  are  not  vertex  disjoint,  which  is 
impossible.  Therefore,  there  are  no  k- cycles  for  k  >  4.  □ 

Although  3-path  optimality  does  not  preclude  3-cycles,  they  are  unlikely  to  exist 
in  3-path  optimal  graphs  obtained  from  real  world  networks  using  Algorithm  1,  as 
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Lemma  2.4.5  requires  that  the  vertices  of  a  3-cycle  in  such  graphs  have  exactly  the 
same  connectivity  to  every  vertex  outside  the  3-cycle,  which  is  extremely  unlikely, 
especially  in  large  graphs. 

Lemma  2.4.5.  For  a  3-cycle  C  in  Ga  and  any  vertex  v  not  in  C ,  if  there  is  a  path  i r 
in  Ga  that  connects  v  and  C ,  then  there  is  an  edge  of  Ga  between  v  and  each  vertex 
of  C,  all  in  the  same  direction  as  i r  (i.e.  from  v  to  C  or  from  C  to  v). 

Proof.  Let  C  =  (u0,  iq,  i>2,  u0).  Without  loss  of  generality,  assume  tt  is  from  v  to  v0  and 
has  odd  length.  Successive  application  of  Lemma  2.4.3  to  the  paths  tt,  (v,vq,vi,V2) 
and  then  (v,  v2,  v0,  iq),  we  obtain  (v,v0)  G  Ga,  (u,u2)  G  Ga  and  (u,tq)  G  Ga  in  the 
same  order.  □ 

2. 4. 2. 2  Maximum  Digraphs 

In  this  subsection,  we  consider  some  properties  of  maximum  digraphs  that  are 
not  direct  consequences  of  3-path  optimality.  The  associated  suboptimal  structures 
may  be  left  intact  by  Algorithm  1  and  require  extra  attention.  Throughout  this 
subsection,  let  G *  denote  a  maximum  digraph  with  a  given  bi-sequence  (d+,d_),  i.e. 
G*  G  0(d+,  d~)  and  p(G*)  =  p{ d+,  d"). 

We  know  from  Lemma  2.4.4  that  large  cycles  involving  only  unreciprocated  edges 
are  suboptimal  structures,  but  certain  cycles  of  even  length  that  contains  reciprocated 
edges  are  also  suboptimal.  In  particular,  we  have  the  following 

Lemma  2.4.6.  Let  C  be  an  even  cycle  in  H  G  f/(d+,d”).  If  any  two  edges  m  C fl  Hs 
are  separated  by  an  odd  number  of  edges  in  C ,  then  there  exists  H'  G  f?(d+,d~)  with 
p(H' )  =  p(H)  +  \Ca\  —  2\ Ca  D  Hs\,  where  Ca  is  the  anti- symmetric  part  of  C ,  i.e. 
Ca  =  {(i,j)eC-.(j,i)tC}. 

Note  that  C  fl  Ha  C  Ca  but  it  is  not  necessarily  true  that  Ca  =  C  fl  Ha.  The  two 
edges  (3, 4)  and  (5, 0)  in  Figure  2.6(a)  are  in  Ca  but  not  in  CnHa.  Any  cycle  satisfying 
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D 


-  4 

(b) 


Figure  2.6:  Suboptimal  even  cycle  with  reciprocated  edges.  Reciprocity  can  be  in¬ 
creased  by  rewiring  the  edges  marked  by  red  crosses  into  the  dashed  green  edges. 


the  conditions  in  Lemma  2.4.6  is  suboptimal  if  it  has  more  anti-symmetric  edges  than 
symmetric  ones.  The  cycles  (0, 1,  2, 3, 4,  5,  0)  in  Figure  2.6(a)  and  (0, 1,  2,  0,  5, 3, 4,  5,  0) 
in  Figure  2.6(b)  are  two  such  examples.  Note  that  these  two  cycles  are  not  automat¬ 
ically  eliminated  by  Algorithm  1. 

Proof  of  Lemma  2.4-6.  Let  C  =  (z>o> ih,  •  •  •  ,v2p-i,v2p  =  r’o),  where  the  vertices  are 
labeled  such  that  (v2p-i,v0)  G  Hs  if  C  fl  Hs  ^  0.  Note  that  the  vertices  may  not  be 
distinct.  Note  also  that  we  must  have  (v2k,v2k+i)  G  Ha  for  all  k.  If  (v2k,v2k+i)  £  Hs 
for  some  k,  then  the  number  of  edges  in  C  between  (v\,v2)  and  ( v2k,v2k+i )  would  be 
2 k  —  2,  contradicting  the  assumption  that  any  two  edges  in  C  fl  Hs  are  separated  by 
an  odd  number  of  edges  in  C.  As  illustrated  in  Figure  2.6,  let 

H'  =  H  —  {(y2i_i,v2i)Yi=l  +  {(v2i-i,v2i-2)}?=1. 

Since  (v2l-2,  v2l-i)  €  Ha,  we  have  (v2i_ i,v2i^2)  ^  H  and  hence  H'  e  ty(d+,d~).  Note 
that  the  edges  in  C  fl  II „  are  either  absent  from  H'  or  in  H's,  so  Ha  —  H'a  =  C  D  Ha  = 
Ca  fl  //„.  On  the  other  hand,  all  edges  in  C  fl  Hs  are  removed  from  //'.  so  H'a  —  Ha  = 
Ca  fl  Hs.  Thus  by  going  from  H  to  H',  we  eliminated  \CaP\  Ha\  unreciprocated  edges 
while  creating  \CaP\  Hs\  new  ones.  Therefore,  p(H')  =  p{H )  —  \Can  Hs  \  +  \CaP\  Ha\. 
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Since  \Ca\  =  \Ca  fl  Hs\  +  \Ca  Pi  Ha\,  the  desired  conclusion  follows  by  eliminating 
\CaDHa\.  □ 

Lemma  2.4.7  specifies  how  multiple  3-cycles  should  be  connected  in  maximum 
digraphs.  If  we  collapse  each  3-cycle  into  a  single  vertex  by  contracting  its  edges, 
the  subgraph  of  G *  induced  by  these  vertices  will  have  the  structure  in  Figure  2.4. 
Therefore,  while  the  existence  of  multiple  3-cycles  is  already  very  unlikely  in  3-path 
optimal  digraphs,  it  is  even  less  likely  in  maximum  digraphs  with  degree  bi-sequences 
of  real  world  networks. 

Lemma  2.4.7.  The  set  of  all  distinct  3-cycles  of  G*  can  be  linearly  ordered  as 
Co,  C\,. . .  ,Cm  such  that  there  are  9  edges  of  G*  going  from  Ci  to  Cj  for  all  0  < 
i  <  j  <  m. 

Proof.  Consider  two  distinct  3-cycles  C  =  (vo,  Vi,  i>2,  Vo)  and  C  =  (wo,wi,W2,wo). 
There  cannot  exist  a  pair  of  edges  from  G *  that  connect  C  and  C';  otherwise, 
say  (vo,wo)  G  G*,  the  cycle  (u0,  v\,  V2,  vq,  wq,  w\,  W2,  wq,  vq)  would  be  suboptinml  by 
Lemma  2.4.6.  On  the  other  hand,  there  must  be  at  least  one  edge  between  C  and  C’\ 
otherwise,  replacing  Ci  and  Cj  by  the  three  pairs  of  edges  {(uj,-uy),  (u!j,  Uj)}?=0  would 
increase  the  reciprocity.  Without  loss  of  generality,  assume  (vo,wo)  G  G*.  It  then  fol¬ 
lows  from  Lemma  2.4.5  that  (ig,  Wj)  G  G*a  for  all  i ,  j  G  {0, 1,  2}.  By  Lemma  2.4.4,  such 
edges  cannot  be  part  of  any  cycle.  Therefore,  we  can  sort  the  3-cycles  topologically 
and  label  them  in  the  desired  way.  □ 

The  next  lemma  complements  Lemma  2.4.3  by  specifying  connection  patterns  of 
elementary  paths  of  even  length. 

Lemma  2.4.8.  Let  n  =  (■ v0 ,  Vi, . . . ,  u2p)  be  an  elementary  path  of  even  length  2p  >  4 
in  G*a,  E0  =  {(v2i,v2 j)  :  i  ±  .]}  and  Ex  =  {(v2i- i,v2j-i)  :  i  ±  j}.  If  (v0,v2 P)  £  G*a, 
then  G *  either  has  all  the  edges  in  E0  but  none  in  E\,  or  vice  versa. 
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(a)  E0  C  G*,  E1  n  G*  =  0. 


(b)  Ex  c  G*,  Eq  n  G*  =  0. 


Figure  2.7:  Patterns  of  even  paths  in  maximum  digraphs.  Each  undirected  solid  edge 
represents  a  pair  of  reciprocated  edges  in  G*.  Each  dashed  edge  represents  a  pair  of 
edges  that  are  both  missing  in  G*. 

Proof.  See  Appendix  A. 3.  □ 

Figure  2.7  shows  both  possibilities  for  an  elementary  path  of  length  4.  The  short¬ 
cuts  required  by  Lemma  2.4.3  are  also  shown.  The  red  dashed  edges  represent  those 
that  cannot  coexist  with  the  green  edges  in  a  maximum  digraph.  Some  suboptimal 
structures  that  violate  Lemma  2.4.8  cannot  be  automatically  eliminated  by  Algo¬ 
rithm  1.  For  example,  if  the  pair  of  edges  between  the  vertices  0  and  2  are  missing 
from  Figure  A. 4,  the  resulting  suboptimal  digraph  will  be  left  intact  by  Algorithm  1. 

2.4.3  Some  Examples 

In  this  subsection,  we  illustrate  how  the  structural  patterns  of  the  previous  sub¬ 
section  may  be  used  to  pin  down  the  maximum  digraph  in  some  special  cases.  Here 
G*  always  denotes  a  maximum  digraph. 

Proposition  2.4.9  shows  that  when  the  bi-sequence  is  perfectly  balanced,  the  max¬ 
imum  digraph  achieves  perfect  or  near-perfect  reciprocity.  Therefore,  any  unfulfilled 
reciprocity  must  be  due  to  the  lack  of  effort  to  form  reciprocal  edges  rather  than  due 
to  the  fundamental  limit  imposed  by  the  bi-sequence  itself. 

Proposition  2.4.9.  Suppose  (d+,d~)  is  perfectly  balanced,  i.e.  z/(d+,d~)  =  0. 

(1) .  //£(d+,d-)  is  even,  then  p(d+,d  )  =£(d+,d  ). 

(2) .  Ife(d+,d~)  is  odd,  then  p(  d+,d~)  =  e(d+,d-)— 3,  andG*  consists  of  a  3-cycle. 
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Proof.  Since  z/(d+,  d_)  =  0,  we  have  df  =  df  for  all  i.  Thus  any  edge  of  G*  must  be 
contained  in  a  cycle  of  length  at  least  3  in  G*.  By  Lemma  2.4.4,  the  length  of  such  a 
cycle  is  exactly  3.  By  Lemma  2.4.7,  there  is  at  most  one  such  cycle  in  G*.  Thus  G* 
is  either  empty  or  a  3-cycle.  Since  p{G*)  must  be  even,  the  former  case  corresponds 
to  even  e(d+,  d”)  and  the  latter  odd  e(d+,  d_).  □ 

The  next  proposition  shows  that  when  the  bi-sequence  is  slightly  unbalanced,  the 
number  of  possible  values  of  p(d+,d~)  increases.  This  sheds  some  light  on  why  the 
maximum  reciprocity  problem  is  so  difficult.  As  the  total  unbalanced  degree  increases, 
the  number  of  possibilities  is  expected  to  explode. 

Proposition  2.4.10.  Suppose  (d+,d~)  is  slightly  unbalanced  with  z/(d+,d~)  =  1, 
dp  —  dg  =  1  and  df  —  df  =  1 . 

(1) .  J/e(d+,d")  is  even,  then  the  gap  e(d+,d  )  —  p(d+,  d  )  is  either  2  or  f.  When 

the  gap  is  2,  the  two  edges  in  G*  form  a  2-path  from  0  to  1.  When  the  gap  is 
4,  G*  is  the  vertex  disjoint  union  of  {(0, 1)}  and  a  3-cycle. 

(2) .  Ife{ d+,d")  is  odd,  then  the  gap  e(d+,d  )  —  p(d+,d  )  is  either  1  or  5.  When 

the  gap  is  1,  G*  =  {(0, 1)}.  When  the  gap  is  5,  G *  is  the  vertex  disjoint  union 
of  a  2-path  from  0  to  1  and  a  3-cycle. 

Proof.  Note  that  there  must  be  a  path  from  0  to  1  in  G*.  Let  n  be  the  shortest  path 
from  0  to  1  in  G*.  All  edges  in  G *  —  n,  if  there  is  any,  must  be  contained  in  a  cycle  in 
G*.  By  Lemma  2.4.4,  G *  can  only  have  3-cycles.  If  G*  had  more  than  one  3-cycles, 
Lemma  2.4.7  would  require  that  there  be  at  least  9  edges  in  G*  that  are  not  contained 
in  any  cycle,  all  of  which  must  be  in  tt.  Lemma  2.4.3  shows  that  n  has  either  one  or 
two  edges.  Therefore,  G*a  —  n  is  either  empty  or  has  one  3-cycle.  By  Lemma  2.4.5, 
tt  and  the  3-cyclc,  if  there  is  one,  must  be  vertex  disjoint.  Since  1 7r|  G  {1,2},  and 
|G*  — 7r|  G  {0,  3},  it  follows  that  e(d+,  d~)  —  p(G*)  =  |G*|  =  1 7r|  +  |G*  —  tt\  <  2  +  3  =  5. 


27 


d 


e 


(a)  G. 
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(b)  7 r  and  G*  —  ir. 


Figure  2.8:  Example  2.4.11. 

Note  that  p(G*)  is  even.  If  e(d+,  d~)  is  even,  then  e(d+,  d-)—  p(G*)  is  equal  to  |7r|  =  2 
or  a(G*)  =  1 7r |  +  | Ga  —  7r|  =  1  +  3  =  4.  If  e(d+,  d~)  is  odd,  then  e(d+,  d~)  —  p(G*)  is 
equal  to  1 7r|  =  1  or  |7r|  +  | G*  —  7r|  =  2  +  3  =  5.  □ 

It  is  easy  to  come  up  with  examples  where  the  gaps  are  1  and  2,  respectively.  The 
next  examples  shows  that  the  other  two  cases  are  also  possible. 

Example  2.4.11.  Let  (d+,d_)  =  {(1,  3,  2,  2, 2),  (0, 4,  2,  2,  2)}.  Figure  2.8(a)  shows 
a  realization  G  of  this  bi-sequence,  where  each  undirected  edge  represents  a  pair  of 
edges  in  opposite  directions.  Note  that  p{G)  =  e(d+,d-)  —  4.  We  claim  that  p(G)  = 
p(d+,d~).  If  not,  then  p(G*)  =e(d+,d”)  — 2  by  Proposition  2.f.  10,  and  the  two  edges 
in  G *  form  a  2-path  i r  from  a  to  b.  Since  c,  d,  e  have  the  same  in-  and  out- degrees 
and  hence  are  equivalent,  we  may  assume  without  loss  of  generality  that  7 r  =  (a,  c,  b ) . 
Thus  G*  —  Ti  is  symmetric  and  corresponds  to  a  simple  graph  with  degree  sequence 
d  =  {0,3, 1,2,2}.  There  is  only  one  simple  graph  with  this  degree  sequence,  which  is 
shown  by  the  black  edges  in  Figure  2.8(b).  When  we  superimpose  7 r  and  G*  —  7 r,  there 
are  two  edges  from  (c,b),  and  hence  G*  ^  C/(d+,d~),  a  contradiction. 

Example  2.4.12.  Let  (d+,  d~)  =  {(1,  0, 4, 2,  2,  2),  (0, 1, 4,  2,  2,  2)}.  Figure  2.9  shows 
a  realization  G  of  this  bi-sequence,  where  each  undirected  edge  represents  a  pair  of 
edges  in  opposite  directions.  Note  that  p(G)  =  e(d+,d“)  —  5.  Since  the  sequence 
d+  Ad  =  {0,  0, 4,  2,  2,  2}  is  not  graphic,  Proposition  2.3.1  shows  that  p(d+,d  )  < 
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e(d+,d  )  —  1.  Thus  Proposition  2-4-10  yields  p(G)  =  p(d+,d  ).  In  fact,  G  is  the 
only  element  ofQ( d+,d"). 


a  e 


Figure  2.9:  Example  2.4.12. 


2.5  Empirical  Study 

In  this  section,  we  conduct  an  empirical  analysis  of  real  networks  by  comparing 
the  observed  values  of  reciprocity  against  the  upper  bounds.  We  also  look  at  the 
lower  bounds  on  maximum  reciprocities  given  by  Algorithm  1. 

2.5.1  Datasets 

The  networks  that  we  analyze  include  major  online  social  networks  (OSN)  that 
are  directed  in  nature  [56,  51,  35,  77,  52],  For  the  purpose  of  comparison,  we  have 
also  included  other  types  of  networks:  biological  networks  [73,  74,  68,  76,  70,  80], 
communication  networks  [52],  product  co-purchasing  networks  [52],  web  graphs  [52], 
Wikipedias  [1],  software  call  graphs  [75,  66],  and  P2P  networks  [52],  All  the  datasets 
except  for  Wikipedias  are  already  converted  into  graph  representations  by  other  re¬ 
searchers  and  the  descriptions  for  the  datasets  can  be  found  at  the  cited  sources.  For 
Wikipedias,  each  node  represents  a  page.  Only  article  pages,  i.e.  pages  with  names¬ 
pace  ID  0,  are  included.  Pages  that  redirect  to  the  same  page  are  represented  as  a 
single  node  corresponding  to  the  destination  page.  There  is  an  edge  from  node  A 
to  node  B  if  there  is  at  least  one  hyperlink  from  page  A  to  page  B.  Multiple  edges 
and  self-loops  are  discarded.  Some  basic  statistics  of  the  networks  can  be  found  in 
Appendix  B. 


29 


0, 

0, 

0, 

>*n 

0, 

o 

.1° 

cdO, 

0, 

0, 

0, 


1 

.9  - 
.8  - 
7  - 
.6 

,5  b 
A 
,3  b 
2 
,1  b 


0 


— 

y=1  .lx-0.21 

• 

biological 

• 

communication 

• 

co-purchase 

social 

• 

web 

• 

wikipedia 

• 

software  call 

• 

p2p 

Slashdot081 1 
Slashdot0902 
YouTube 


/ 


/ 


/ 


Dutch 


Google+  / 

/•Google 

Stanford—^ 

email^eArbarf  ' 

•Swedisp'  ego-Google+ 

wiki-Talk*  ,  ^  *Spanish 


Italian— 

Polish^-  p( 
NotreDame*  ^us^hsh 

r-Fre 

/  ''©Japanese 
/Ger^anEPinions1 
Twitter 


/ 

LiveJournall  / 
LiveJournal  / 

/ 

/  mouse-cortex 
Ohinese^iickr 
.  /Korean 


. Pokec 


Portuguese*'  ego-Twitter 
/French 


/ 


/ 


W- 

0 


east 


^yeas 


/ 

/ 

*  'itackOF 

-4- — s  5 - 


•C.EIegan 

•  protein-interaction 


wiki-Vote 

—  a- 


0.2 


0.4  0.6 

upper  bound 


0.8 


Figure  2.10:  Scatter  plot  of  empirical  reciprocity  versus  upper  bound.  Regression  line 
was  fitted  without  data  points  for  biological,  P2P  and  software  call  networks. 


2.5.2  Empirical  Reciprocity  vs.  Upper  Bound 

Figure  2.10  shows  the  scatter  plot  of  empirical  reciprocities  against  the  correspond¬ 
ing  upper  bounds.  Here  the  upper  bound  is  normalized  by  the  number  of  edges,  i.e., 
it  is  the  ratio  /3(d+,  d~)/£(d+,  d_).  Note  that  the  reciprocity  values  vary  widely, 
ranging  from  0  for  the  peer-to-peer  network  Gnutella  to  90%  for  the  online  social 
network  Slashdot.  There  is  even  a  fair  amount  of  variation  within  the  categories  of 
biological,  social  and  Wikipedia  networks.  In  general,  social  networks  and  Wikipedia 
networks  tend  to  have  high  reciprocity,  while  software  call  networks  tend  to  have 
low  reciprocity.  Note  the  strong  linear  correlation  between  empirical  reciprocity  and 
the  upper  bound.  This  is  a  little  bit  surprising,  especially  for  the  social  networks, 
in  view  of  the  large  variations  in  reciprocity.  Related  to  Figure  2.10  is  the  scatter 


30 


108 

tt> 

a> 

0) 

■o  R 
©  106 

■a 

0 

■4—' 

05 

2  104 
Q. 

O 

0 

^  IQ2 


10' 


—  y=0.14x109 

•  biological 

•  communication 

•  co-purchase 
social 

•  web 

•  wikipedia 

•  software  call 


Stackoverflow 


v  . 


•  . 

•m  © 


101 


10J  10°  10' 

upper  bound 


10‘ 


Figure  2.11:  Scatter  plot  of  number  of  reciprocated  edges  versus  upper  bound.  Re¬ 
gression  line  was  fitted  in  log  scale,  without  data  points  for  software  call  networks. 


plot  in  Figure  2.11  of  number  of  reciprocated  edges  against  the  unnormalized  bound 
/3(d+,d~).  There  the  linear  relationship  in  log-log  scale  is  even  more  apparent,  with 
biological  networks  being  also  around  the  regression  line.  These  linear  relationships 
suggest  that  there  might  exist  some  universal  mechanism  that  works  across  different 
domains. 

Despite  the  wide  variation  in  reciprocity,  the  ratio  between  the  empirical  reci¬ 
procity  and  the  normalized  upper  bound  has  a  much  narrower  range  as  shown  by  the 
box  plots  for  the  ratios  in  Figure  2.12. 

Note  that  the  ratios  are  close  to  zero  for  the  P2P  network  Gnutella  and  software 
call  graphs.  The  Gnutella  exhibits  zero  reciprocity,  far  away  from  the  upper  bounds 
that  are  above  30%.  This  is  probably  because  Gnutella  implements  an  indirect  reci¬ 
procity  mechanism.  The  low  reciprocity  for  software  call  graphs  is  not  surprising,  as 
software  codes  are  usually  designed  to  work  in  a  hierarchical  manner.  The  case  for 
biological  networks  are  more  complicated,  as  the  four  biological  networks  considered 
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Figure  2.12:  Box  plot  of  reciprocity-bound  ratio  for  different  network  categories. 


here  are  actually  of  quite  different  natures.  For  example,  the  C.  Elegan  neural  net¬ 
work  and  the  mouse  cortex  network  are  both  neural  networks,  but  the  former  is  at  the 
neuron  level  while  the  latter  is  at  a  coarser  level  of  cortical  regions.  One  can  speculate 
that  both  the  low  reciprocity  in  C.  Elegan  neural  network  and  the  high  reciprocity  in 
the  mouse  cortex  network  are  due  to  biological  reasons.  However,  we  do  not  know  if 
this  behavior  is  a  norm  or  an  exception  due  to  the  lack  of  data  for  similar  networks. 

In  all  categories  other  than  biological,  software  call  and  P2P  networks,  the  ratios 
are  above  50%  with  only  three  exceptions:  the  wiki- Vote  network,  the  Stack  Overflow 
Q&A  network,  and  the  Spanish  Wikipedia.  Although  we  have  classified  the  Stack 
Overflow  Q&A  network  as  a  social  network,  it  differs  from  typical  social  networks. 
The  low  reciprocity  suggests  that  there  is  a  hierarchy  of  expertise.  What  is  more 
interesting  is  the  wiki- Vote  network  and  the  Spanish  Wikipedia,  as  their  behaviors 
deviate  from  those  of  other  networks  of  the  same  category,  which  suggests  that  there 
might  be  something  unusual  about  them  that  is  worthy  of  scientific  study.  Apart  from 
the  three  outliers,  all  other  networks  in  these  categories  actually  achieve  a  significant 
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fraction  of  the  possible  reciprocity  suggested  by  the  upper  bound.  This  means  that 
modulo  the  degree  constraints,  the  tendency  to  reciprocate  is  much  stronger  than  the 
empirical  reciprocity  alone  might  have  suggested.  Prominent  examples  include  the 
web  graphs,  the  Swedish  Wikipedia  and  the  Google+  network,  whose  reciprocities  are 
not  very  high  in  absolute  value  but  quite  high  relative  to  the  bound.  This  suggests 
that  when  we  study  these  networks,  it  might  be  more  meaningful  to  ask  the  question 
why  there  is  such  large  imbalance  in  degrees  than  to  ask  the  question  why  the  tendency 
to  reciprocate  is  low. 

2.5.3  Reciprocity  of  3-path  Optimal  Digraphs 

In  this  subsection,  we  look  at  3-path  optimal  digraphs  returned  by  Algorithm  1. 
Note  that  the  reciprocity  of  such  a  digraph  provides  a  lower  bound  on  the  maximum 
reciprocity  of  the  corresponding  degree  bi-sequence. 

Figure  2.13  shows  the  scatter  plot  of  the  reciprocities  of  the  3-path  optimal  di¬ 
graphs  against  the  corresponding  upper  bounds.  Figure  2.14  shows  the  box  plots  of 
their  ratios.  Note  that  the  reciprocities  of  3-path  optimal  digraphs  are  close  to  the 
upper  bounds,  especially  for  communication,  co- purchasing,  social  and  Wikipedia 
networks.  This  means  that  the  maximum  reciprocities  are  also  close  to  the  upper 
bounds.  Therefore,  for  the  degree  bi-sequences  of  those  real  networks,  the  fundamen¬ 
tal  limit  that  they  impose  on  reciprocity  is  largely  summarized  by  the  upper  bounds, 
and  the  major  source  of  loss  in  reciprocity  is  the  existence  of  3-paths  of  Types  I,  II 
and  III.  Thus  in  practice  Algorithm  1  usually  suffices  for  approximating  maximum  re¬ 
ciprocities  and  we  do  not  need  to  worry  much  about  the  more  complicated  suboptimal 
structures  in  Section  2. 4. 2. 2. 

Finally,  recall  from  Section  2.4.2. 1  that  the  existence  of  3-cycles  in  a  3-path  optimal 
digraph  requires  some  specific  structures.  These  structures  are  usually  too  special  to 
occur  in  practice,  so  3-cycles  are  unlikely  to  exist  in  3-path  optimal  digraphs.  This 
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Figure  2.13:  Scatter  plot  of  reciprocity  of  the  3-path  optimal  digraph  returned  by 
Algorithm  1  versus  upper  bound. 


Figure  2.14:  Box  plot  of  ratio  between  reciprocity  of  3-path  optimal  digraph  returned 
by  Algorithm  1  and  upper  bound. 


34 


is  indeed  the  case  for  most  of  the  3-path  optimal  digraphs  obtained  from  the  real 
networks  studied  here,  the  anti-symmetric  parts  of  which  turn  out  to  be  acyclic. 

2.6  Conclusion 

In  this  chapter,  we  showed  that  the  maximum  reciprocity  problem  is  NP-hard. 
We  provided  a  partial  characterization  of  networks  with  maximum  reciprocity  and  a 
greedy  algorithm  to  eliminate  suboptimal  motifs.  We  also  provided  an  upper  bound 
on  reciprocity  along  with  necessary  conditions  and  sufficient  conditions  for  achiev¬ 
ing  the  bound.  We  demonstrated  that  the  bound  is  surprisingly  close  to  the  observed 
reciprocity  in  a  wide  range  of  real  networks,  which  suggests  that  the  tendency  to  form 
reciprocal  edges  might  be  much  stronger  than  the  observed  reciprocity  indicates.  We 
found  surprising  linear  relationships  between  empirical  reciprocities  and  the  corre¬ 
sponding  upper  bounds.  We  showed  that  a  particular  type  of  suboptimal  motif  called 
3-paths  is  the  major  source  of  loss  in  reciprocity  in  these  networks. 
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CHAPTER  3 


COMPETITION  UNDER  CUMULATIVE  ADVANTAGE 

3.1  Introduction 

Growth  is  a  fundamental  aspect  inherent  to  most  networks  that  has  been  widely 
investigated  both  empirically  through  the  analysis  of  data  from  various  contexts  and 
theoretically  through  idealized  models.  An  important  driving  force  behind  network 
growth  and  in  particular  the  evolution  of  node  degrees  is  cumulative  advantage  (CA), 
where  accumulated  edges  (i.e. ,  current  node  degree)  promote  gathering  even  more 
edges;  see  [62]  and  the  reference  therein.  A  second  widely-accepted  driving  force  in 
this  context  is  fitness,  which  captures  the  inherent  ability  of  nodes  to  attract  edges. 
Thus,  dynamics  of  network  growth  is  governed  by  skill  (fitness)  and  luck  (random 
but  biased  edge  attachment). 

Recent  work  has  framed  the  problem  of  network  growth  as  a  competition  among 
nodes  [50,  72,  63].  In  essence,  nodes  in  a  network  compete  with  one  another  to 
accumulate  edges,  increasing  (or  descreasing)  their  degrees  over  time.  As  expected, 
such  competitions  are  also  driven  by  skill  and  luck  and  have  been  studied  empirically 
and  theoretically  for  different  networks,  an  example  of  which  is  the  evolution  and 
predictability  of  success  in  citation  networks  [72],  Outside  the  domain  of  networks, 
the  study  of  skill  and  luck  competitions  has  a  long  history  in  social  and  physical 
sciences  [7,  22], 

However,  the  intricacies  of  skill  and  luck  competitions  are  far  from  trivial,  even 
in  a  simple  CA  model  with  just  two  competitors.  To  illustrate,  consider  a  network 
with  two  hub  nodes  that  compete  for  connectivity.  Each  time  a  new  node  joins  the 
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network,  it  connects  to  one  of  the  hubs  randomly  with  bias  depending  on  the  hubs’ 
fitnesses  (model  details  given  in  Section  3.3).  In  the  presence  of  CA,  the  bias  also 
depends  on  the  hubs’  current  degrees.  Figure  3.1(a)  and  Figure  3.1(b)  illustrate  the 
difference  of  the  hubs’  degrees  over  time  for  two  sample  paths  in  the  absence  and 
presence  of  CA,  respectively.  The  paths  with  the  same  color  and  label  in  both  plots 
are  generated  using  the  same  pseudorandom  sequence.  The  competition  is  tied  every 
time  the  degree  difference  is  zero  and  we  define  the  competition  duration  as  the 
time  until  the  final  tie  occurs.  Note  that  the  red  path  in  Figure  3.1(b)  lasts  much 
shorter  than  the  red  path  in  Figure  3.1(a),  while  the  blue  path  in  Figure  3.1(b)  lasts 
significantly  longer  than  the  blue  path  in  Figure  3.1(a).  This  suggests  a  potentially 
larger  variance  in  competition  duration  in  the  presence  of  CA.  Moreover,  with  CA 
the  less  fit  hub  may  enjoy  a  sizable  degree  leadership  for  a  long  time;  see  the  blue 
path  in  Figure  3.1(b).  These  observations  also  apply  to  two  specific  nodes  in  more 
general  network  growth  models,  provided  that  we  interpret  “time”  as  the  increase  in 
the  total  degrees  of  the  two  given  nodes.  However,  are  these  sample  paths  anomalies 
or  the  norm?  Can  we  be  more  precise  about  these  observations? 

In  this  chapter,  we  aim  to  develop  a  fundamental  understanding  of  the  effects  of 
CA  in  such  growth  competitions.  We  approach  this  problem  by  considering  classical, 
simple  and  well-studied  theoretical  models  for  competitions  based  on  skill  and  luck 
that  are  either  coupled  with  or  free  of  cumulative  advantage.  These  models  may  not 
be  general  enough  as  statistical  models  that  fit  real-world  data  for  competitions  in 
growing  networks,  as  such  models  must  capture  intricate  features  of  the  domain,  such 
as  skill  distribution  or  amplitude  of  cumulative  advantage  (e.g.,  linear  or  sub-linear), 
as  well  as  their  time  dependency.  However,  they  still  provide  invaluable  insights  into 
how  CA  impacts  competitions.  More  specifically,  we  focus  on  competitions  between 
two  agents  (nodes)  and  study  two  fundamental  aspects  of  competitions:  duration 
-  the  time  required  for  the  most  skilled  to  overtake  its  competitor  and  forever  en- 
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(a)  Competition  without  CA. 


(b)  Competition  with  CA. 


Figure  3.1:  Time  evolution  of  degree  difference  between  two  nodes  in  competitions 
without  and  with  cumulative  advantage  (CA).  Each  plot  shows  two  independently 
simulated  sample  paths.  The  sample  paths  with  the  same  color  in  both  plots  use  the 
same  sequence  of  pseudorandom  numbers.  Competition  starts  tied  and  node  X  is 
10%  fitter  than  node  Y.  See  model  details  in  Section  3.3. 


joy  undisputed  leadership;  intensity  -  the  number  of  times  competitors  tie  for  the 
leadership.  In  this  direction,  we  make  the  following  main  contributions. 

•  In  the  case  where  the  two  competitors  have  equal  fitness,  we  obtain  the  asymp¬ 
totic  tail  distributions  for  both  duration  and  intensity  of  CA  competitions.  We 
demonstrate  that  they  are  power  laws  with  respective  tail  exponents  —1/2  and 
—  1,  which  are  independent  of  the  initial  wealth  of  the  competitors. 

•  In  the  case  where  the  two  competitors  have  unequal  fitness,  we  derive  asymptotic 
lower  and  upper  bounds  for  the  tail  distribution  of  duration  of  CA  competitions, 
and  an  upper  bound  for  the  tail  distribution  of  their  intensity.  These  bounds 
show  that  duration  is  heavy  tailed  while  intensity  is  exponential  tailed  in  the 
presence  of  CA.  In  particular,  duration  is  heavier  tailed  while  intensity  is  lighter 
tailed  than  corresponding  RW  competitions. 

•  We  observe  that  a  slight  difference  in  fitness  of  the  two  results  in  a  extremely 
heavy  tail  for  duration  of  CA  competitions.  Thus,  an  individual  that  is  only 
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slightly  more  skilled  than  his  competitor  might  have  to  hang  on  to  the  compe¬ 
tition  for  an  extremely  long  period  of  time  before  taking  the  ultimate  lead,  a 
phenomenon  we  call  the  “struggle  of  the  fittest” . 

Despite  90  years  since  the  basic  CA  model  was  first  proposed  [25],  known  as 
Polya’s  urn  model,  the  work  in  this  chapter  is,  to  the  best  of  our  knowledge,  the 
first  to  characterize  the  duration  and  intensity  distributions  of  CA  competitions  with 
skill.  We  believe  our  findings  have  profound  implications  to  our  understanding  of 
competitions,  beyond  its  importance  to  the  evolution  of  degree  of  nodes  in  growing 
networks. 

The  rest  of  this  chapter  is  organized  as  follows.  Section  3.2  briefly  discusses  the 
related  work.  Section  3.3  introduces  the  CA  competition  model.  Section  3.4  presents 
the  theoretical  results,  illustrated  and  supplemented  by  simulations.  Section  3.5  con¬ 
cludes  this  chapter. 

3.2  Related  Work 

Resource  accumulation  is  an  ubiquitous  phenomenon  that  naturally  arises  in  a 
variety  of  social  and  complex  systems.  The  problem  is  usually  framed  as  a  competition 
among  agents  for  resources  that  are  abundant,  and  has  been  studied  in  different 
contexts  across  various  disciplines  ranging  from  protein  binding  within  a  cell  [26,  46] 
to  views  of  online  social  media  [14,  29]  and  citations  among  scholarly  papers  [65,  72], 
In  the  context  of  networks,  network  growth  and  in  particular  node  degree  evolution 
has  been  recently  framed  as  competition  among  nodes,  where  different  aspects  of 
competitions  have  been  studied  empirically  and  theoretically  [72,  63,  33,  50,  13]. 

Models  for  resource  accumulation  competitions  generally  incorporate  skill  (fit¬ 
ness),  luck  (randomness)  and  externalities.  Cumulative  advantage  is  one  type  of 
externality  that  is  considered  a  general  mechanism  for  inequality  [22],  It  appears  in 
the  literature  under  many  variants  such  as  Price’s  cumulative  advantage  model  [65], 


39 


preferential  attachment  [9,  10,  12],  “the  rich  get  richer”,  Matthew  effect  [22,  55,  62], 
and  path-dependent  increasing  returns  [7].  The  Polya’s  urn  model  [25,  54]  is  widely 
used  to  capture  these  effects.  Most  previous  work  on  Polya’s  urn  model  and  its  gen¬ 
eralizations  focuses  on  the  share  of  resources  gathered  by  each  agent,  also  known 
as  the  agent’s  market  share,  proving  convergence  and  limiting  results  of  the  market 
share  distribution  [54,  43,  61].  More  recent  studies  consider  Polya’s  urn  models  with 
non-linear  bias  [23],  the  effects  of  initial  conditions  of  urns  [59,  15],  as  well  as  the  time 
for  the  first  tie  [6]  and  probability  of  a  tie  ever  occuring  [71]. 

However,  two  fundamental  metrics  associated  with  competitions,  duration  -  how 
long  it  takes  for  the  undisputed  winner  to  emerge,  and  intensity  -  how  many  times 
the  competitors  tie  for  the  leadership,  have  largely  been  neglected  in  the  literature. 
Previous  results  establish  that  the  most  skilled  agent  eventually  wins  [54],  and  that 
average  intensity  up  to  time  t  is  approximately  (logf)a,  where  a  depends  on  the 
relative  skill  of  the  competitors  [34,  33].  To  the  best  of  our  knowledge,  no  previous 
work  has  provided  rigorous  characterizations  for  the  distributions  of  duration  and 
intensity  of  competitions  in  Polya’s  urn  models.  This  chapter  partially  fills  this  gap  for 
the  two  competitor  case  and  sheds  light  on  some  recent  approximate  results  [34,  33]. 

3.3  Models 

In  this  section,  we  formally  introduce  competition  models  for  two  competing 
agents  and  give  precise  definitions  for  two  fundamental  metrics  of  a  competition, 
i.e.  its  duration  and  intensity. 

3.3.1  General  Setup  and  Metrics 

Let  X  and  Y  denote  the  two  agents  that  engage  in  the  competition.  Each  agent 
is  associated  with  a  positive  fitness  value  that  reflects  its  intrinsic  competitiveness  or 
skill  level.  Let  fx  and  fy  denote  the  fitness  of  A"  and  Y,  respectively,  and  r  =  fx/fy 
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Figure  3.2:  State  space  of  competition  processes,  with  an  illustration  of  a  sample 
path  with  xo  —  2,  y0  —  1,  and  three  ties  at  time  t  —  3,  5,  7. 

the  fitness  ratio.  Without  loss  of  generality,  we  assume  that  fx  >  fy  and  hence 
r  >  1. 

The  resource  that  the  agents  compete  for  will  be  generically  referred  to  as  wealth, 
which  is  measured  in  discrete  units.  The  competition  starts  at  time  t  —  0  with 
agents  X  and  Y  having  xq  and  yo  units  of  initial  wealth,  respectively.  We  consider  a 
discrete-time  process.  At  each  time  step,  one  unit  of  wealth  is  added  to  the  system 
and  given  to  either  X  or  Y.  Denote  by  Xt  and  Yt  the  respective  cumulative  wealth 
of  X  and  Y  at  time  t.  The  complete  history  of  the  competition  {(Ah,17)}^o  then 
forms  a  discrete-time  discrete-space  stochastic  process.  The  state  space  S  is  the  first 
quadrant  of  the  integral  lattice  (see  Figure  3.2), 

S  =  {(: x,y )  G  Z2  :  x  >  l,y  >  1}. 

The  initial  condition  is  (X0,Y0)  =  (x0,y0).  How  the  process  evolves  over  time  is 
determined  by  specific  competition  models,  of  which  the  CA  competition  model  to 
be  introduced  in  Section  3.3.2  is  an  example. 

We  now  make  the  notions  of  duration  and  intensity  of  competitions  more  pre¬ 
cise  by  defining  them  through  events  of  wealth  ties.  Given  a  competition  process 


41 


{(Xt,Yt)}^ 0,  we  say  that  a  tie  occurs  at  time  t  if  Xt  =  Yt.  Figure  3.2  shows  three 
ties  at  times  t  =  3,5,  7. 

The  duration  T  of  a  competition  is  defined  to  be  the  time  of  the  last  tie,  i.e., 

T  =  sup{f  >  0  :  Xt  =  Yt  }. 

When  there  is  no  tie,  we  follow  the  standard  convention  that  T  =  sup0  =  —  oc.  The 
competition  ends  at  time  T  in  the  sense  that  one  of  the  agents  takes  the  lead  and 
never  lose  it  again  after  T. 

The  intensity  Nt  of  a  competition  until  time  t  is  the  number  of  ties  that  occur  by 
time  t,  i.e., 

t 

Nt  =  lxi=Yn 
i= 0 

where  1  a  is  the  indicator  of  event  A.  The  intensity  N  of  a  competition  is  the  total 
number  of  ties  throughout  the  competition,  i.e.,  N  =  lin This  measures 
the  intensity  of  the  competition  in  the  sense  that  it  counts  the  number  of  potential 
changes  in  leadership.  Note  that  T  <  +oo  if  and  only  if  N  <  +oo. 

3.3.2  CA  Competition  Model 

In  the  CA  competition  model,  the  unit  of  wealth  introduced  at  time  t  + 1  is  given 
to  X  with  probability 


fxXt  _  rXt 
PX fxXt  +  fvYt  rXt  +  Yt ’ 

otherwise  it  is  given  to  Y.  Note  that  the  transition  probability  px.t  embodies  both 
fitness  and  CA  effects  (externalities). 
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More  formally,  in  the  CA  competition  model,  the  complete  history  {(AQ,  y])}[A0 
forms  a  discrete-time  Markov  chain  with  stationary  transition  probabilities.  The 
transition  probability  P[(W+i,  Yt+±)  —  {x',y')  |  ( Xt,Yt )  =  (x,y)\  is  given  by 


QcAAxiy,x'iy') 


rx 

if 

(x',y')  = 

rx  +  y' 

y 

if 

(■ x',y ')  = 

rx  +  y' 

o, 

otherwise. 

ix  +  i,y), 
(x,y  + 1), 


(3.1) 


Note  that  the  transition  probabilities  are  spatially  inhomogeneous,  i.e.,  they  depend 
on  the  current  state  ( x,y ),  which  makes  the  analysis  difficult,  especially  when  r  >  1. 

For  the  purpose  of  comparison,  a  RW  competition  model  incorporates  skill  and 
luck  but  not  the  CA  effect  (no  externalities),  where  the  transition  probabilities  are 
determined  entirely  by  the  fitness  ratio  r.  In  particular,  the  probability  that  agent  X 
receives  the  unit  of  wealth  introduced  at  any  time  is  always  given  by 


fx  =  r 
fx  +  fv  r  +  1  ‘ 


Thus  the  RW  competition  model  is  a  discrete-time  Markov  chain  with  the  same  state 
space  S  as  the  CA  competition  model,  but  with  the  following  spatially  homogeneous 
transition  probabilities, 


(  r 


V)  %  7  y )  \ 


r  +  1  ’ 
1 

r  +  1  ’ 
0, 


if  {x',y')  =  (x  +  1  ,y), 
if  (x',y')  =  (x,y  +  1), 
otherwise. 


The  spatial  homogeneity  of  the  transition  probabilities  leads  to  a  more  tractable 
analysis.  In  fact,  the  difference  process  {Xt  —  Yt}  is  a  standard  biased  RW  with 
parameter  r/(r  +  1).  Thus  the  abundance  of  known  results  for  RW  [38]  can  be 
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directly  translated  into  results  for  RW  competitions,  including  duration  and  intensity 
as  we  have  defined  in  Section  3.3.1. 

Throughout  the  rest  of  this  chapter,  we  use  CA=  and  RW=  to  denote  CA  and 
RW  competitions  with  identical  fitness  (r  =  1),  respectively.  We  use  CA^  and  RW^ 
to  denote  CA  and  RW  competitions  with  distinct  fitnesses  (r  >  1),  respectively. 
Before  presenting  our  results,  we  point  out  some  connections  between  the  CA  and 
RW  models  that  are  useful  in  our  analysis.  In  particular,  in  CA=,  all  paths  connecting 
two  given  states  (.x'o,  yo)  and  (x,y)  have  the  same  probability.  This  is  a  nice  property 
that  CA=  shares  with  RW,  which  enables  us  to  leverage  existing  results  on  RW  in 
our  analysis  of  CA=.  Unfortunately,  this  property  is  lost  in  CA^,  where  we  resort  to 
the  Chapman-Kolmogorov  equation  for  upper  and  lower  bounds  on  the  probabilities 
of  interest.  In  the  limiting  case  where  Xt  and  Yt  are  both  large  but  comparable  to 
each  other,  the  connection  to  RW  is  again  partially  retained,  a  fact  we  also  exploit  in 
the  analysis  of  CA^. 

3.4  Results 

In  this  section  we  present  our  theoretical  results  for  duration  and  intensity  distri¬ 
butions,  which  are  also  illustrated  graphically  and  supported  by  extensive  numerical 
simulations.  Table  3.1  provides  a  summary  of  our  main  results  along  with  prior 
knowledge  about  RW  competitions  from  the  literature.  Note  that  P(model)  r  denotes 
the  probability  in  model  (Model)  e  {CA,  RW}  with  fitness  ratio  r  and  initial  state 
(To,  Vo)-  The  following  notations  have  been  used  in  Table  3.1  and  will  be  used  through¬ 
out  the  rest  of  this  chapter. 

•  f(x)  ~  fj(x)  if  and  only  if  Hindoo  f(x)/g(x)  =  1. 

•  f(x)  ^  a(x)  T  and  only  if  limsup^^  f(x)/g(x)  <  1. 

•  f(x)  ^  9{x)  if  and  only  if  liming.**  f{x)/g(x)  >  1. 
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metric 

(Model) 

r  —  1 

r  >  1 

duration  T  : 

jpCOjJ/Cl)  \rp  v.  j] 

r  (Model)  ,rld  —  '4 

CA 

~  t-P2 

<  f-(r- l)®o 

>  o-A 

RW 

1 

< 

to 

1 _ 1 

t 

intensity  N  : 

> «] 

CA 

~  n-1 

<  1 

<  2  y»- 

k  r+l  / 

-1 

RW 

1 

Table  3.1:  Tail  distributions  for  duration  and  intensity  of  competitions  in  both  RW 
and  CA  models.  Multiplicative  constants  are  omitted  in  all  expressions  involving  t 
and  n.  The  RW  statistics  can  be  found  in  most  textbooks  on  the  topic,  e.g.  [38,  pp. 
113,116], 


All  proofs  are  relegated  to  Appendix  C. 

3.4.1  Competition  Duration 

As  shown  in  Table  3.1,  RW=  competitions  never  end,  i.e.,  [T  =  oo]  =  1, 

while  RW^  competitions  are  generally  very  short,  whose  durations  exhibit  exponen¬ 
tial  tails.  The  story  for  CA  competitions  is  drastically  different.  The  introduction 
of  CA  guarantees  that  a  competition  always  ends,  i.e.  <  oo]  =  1,  even 

when  the  two  agents  are  equally  fit,  which  is  in  sharp  contrast  to  endless  RW=  com¬ 
petitions.  On  the  other  hand,  CA  fundamentally  increases  the  chance  of  having  a 
long-lasting  competition  between  unequally  fit  agents,  as  the  duration  of  CA^  always 
has  a  power- law  distribution,  in  contrast  to  a  sub-exponential  distribution  for  RW^. 
Thus,  cumulative  advantage  does  not  always  make  competitions  shorter  as  one  might 
expect. 

3. 4. 1.1  Equal  Fitness  Case:  CA= 

The  following  theorem  shows  that  the  duration  T  for  CA=  is  heavy-tailed  with  an 
asymptotic  power-law  distribution. 
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Theorem  3.4.1.  The  duration  of  a  CA=  competition  has  the  following  asymptotic 
tail  distribution, 


p(^o>?/o) 

rCA,l 


[T>t] 


r 


_ _ _ t~ 1/2 

2x°+y°-5/2y/nB(Xo,  y0) 


where  B(x,y )  =  f*  sx  7(1  —  s)y  xds  is  the  beta  function. 
It  follows  from  (3.2)  that 


(3.2) 


F’ca”)P’  <  oo]  =  1  -  lim  PjS'f’lT  >  t\  =  1, 

£—>•00  ’ 

i.e.  the  duration  of  CA=  is  almost  surely  finite. 

Note,  however,  that  the  power- law  exponent  is  always  —1/2,  independent  of  the 
initial  wealth  xq  and  y0.  Consequently,  although  the  duration  of  CA=  is  finite  rather 
than  infinite  as  in  RW=,  the  expected  duration  is  still  infinite,  even  if  xq  is  significantly 
larger  than  yo  or  vice  versa. 

On  the  other  hand,  the  initial  wealth  (xq,  yo)  does  affect  the  multiplicative  factor  in 
(3.2).  Figure  3.3  shows  the  duration  distributions  from  simulations  for  various  values 
of  initial  wealth,  with  the  asymptotes  in  Eq.  (3.2)  superimposed.  Each  simulation 
curve  is  the  average  of  105  independent  runs  for  107  time  steps  each.  All  curves  are 
truncated  at  t  =  106,  since  the  empirical  distributions  will  drop  down  sharply  and 
become  inaccurate  as  t  approaches  the  cutoff  time  in  simulations.  Similar  truncations 
will  be  applied  to  later  plots  without  further  mention.  Note  the  good  agreement 
between  theory  and  simulation  in  the  tails  in  Figure  3.3.  When  both  xq  and  yo 
increase  but  are  kept  equal,  the  distribution  curve  shifts  upwards,  which  means  the 
competition  lasts  longer.  When  the  initial  wealth  of  only  one  agent  (yo  here)  increases, 
the  distribution  curve  shifts  downwards,  which  means  the  competition  is  shorter. 
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Figure  3.3:  Tail  distribution  for  duration  of  CA=  with  various  (xo,yo)-  The  dots  are 
simulation  results.  The  solid  lines  are  the  asymptotes  in  Eq.  (3.2). 


3.4. 1.2  Different  Fitness  Case:  CA^ 

The  next  theorem  shows  that  the  tail  distribution  of  the  duration  T  for  CA^  is 
asymptotically  bounded  by  power  laws  from  both  above  and  below. 

Theorem  3.4.2.  The  tail  distribution  of  the  duration  of  a  CA ^  competition  has  the 
following  asymptotic  bounds, 

<P!  r(r“1)ai0  <  P^o)[T>t]  <  ip2t~{r~1)(x°~1/r\  (3.3) 


where 

_ _ r(ra0  +  y0) _ 

V?1  (r  +  l)a:o2xo+OT'1r(a:o)r(|/o)  ’ 

and 

_  2^-1^x°-r~^r(r-1)r(rx0  +  y0) 
^  (r  +  1)(xq  —  r_1)T(a:o)r(?/0) 

where  r(x)  =  J0°°  sx~1e~sds  is  the  gamma  function. 


(3.4) 


(3.5) 


It  follows  from  the  lower  bound  that  P <  oo]  =  1  for  r  >  1,  i.e.  the 
duration  for  CA^  is  almost  surely  finite  as  is  for  CA=.  The  constants  <pi  and  <p2  are 
very  loose,  so  the  bounds  are  best  interpreted  as  bounds  on  the  tail  exponent. 
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Figure  3.4:  Tail  distribution  for  duration  of  CA^  with  r  =  1.2  and  various  (x0,y0). 
The  dots  are  simulation  results.  The  solid  lines  are  the  asymptotic  lower  bound  in 
Eq.  (3.3)  but  shifted  closer  to  the  simulation  results  for  easier  visual  comparison  of 
the  slopes. 


Note  that  the  power-law  exponents  in  the  upper  and  lower  bounds  depend  on 
x0  but  not  on  y0,  and  they  differ  only  by  1  —  1/r  <  1.  In  this  sense,  the  shape  of 
the  distribution  at  large  t  is  largely  determined  by  the  fitness  ratio  and  the  initial 
wealth  of  the  fitter  agent,  while  the  initial  wealth  of  the  less  fit  plays  a  much  weaker 
role.  This  is  illustrated  in  Figure  3.4,  which  shows  the  duration  distributions  from 
simulations  for  r  =  1.2  and  various  values  of  (xo,  !jo),  alongside  the  lower  bounds  from 
Eq.  (3.3)  that  are  shifted  closer  to  the  simulation  results  for  easier  comparison  of  the 
slopes.  Each  simulation  curve  is  the  average  of  105  independent  runs  for  109  time 
steps  each. 

Figure  3.4(a)  shows  how  the  slopes  of  the  distribution  curves,  which  correspond 
to  the  power-law  exponents,  depend  critically  on  xq.  The  impact  of  xq  is  two-fold.  As 
x0  increases,  the  distribution  curve  becomes  more  tilted  as  predicted  by  the  bounds. 
At  the  same  time,  it  also  shifts  downwards.  Both  changes  mean  that  the  competition 
tends  to  be  shorter. 
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Figure  3.4(b)  shows  the  impact  of  changing  both  xQ  and  y0.  When  x$  is  hxed, 
increasing  y0  only  results  in  a  slight  decrease  in  the  absolute  value  of  the  slope,  in 
agreement  with  Eq.  (3.3).  The  distribution  curve  shifts  upwards,  which  means  the 
competition  tends  to  last  longer.  When  both  xq  and  yo  increase,  the  situation  becomes 
more  intricate.  The  curve  may  shift  upwards  while  bending  down  faster  in  the  tail, 
which  could  possibly  lead  to  a  crossover  in  the  old  and  new  curves,  as  is  the  case  of 
going  from  (x0,y0)  =  (1,1)  to  (x0,y0)  =  (3,3).  In  this  case,  the  new  competition  is 
more  likely  to  have  a  medium  duration. 

3.4. 1.3  Struggle-of-the-Fittest  Phenomenon 

Now  we  look  at  the  impact  of  fitness  ratio  r  on  duration.  Contrasting  Eqs.  (3.2) 
and  (3.3)  leads  to  an  interesting  observation.  Departing  from  CA=  by  slightly  increas¬ 
ing  the  fitness  ratio  r  from  1  to  1  +  s,  where  £  is  close  to  0,  precipitates  a  significant 
increase  in  the  probability  of  long-lasting  competitions,  as  manifested  in  the  discon¬ 
tinuous  jump  in  the  power-law  exponents  from  —1/2  in  Eq.  (3.2)  to  —£Xq  ~  0  in 
Eq.  (3.3).  This  is  opposite  to  what  happens  in  RW  competitions,  where  a  slight  in¬ 
crease  in  fitness  departing  from  RW=  to  RW^  transforms  the  competition  from  one 
that  never  ends  to  one  with  a  geometrically  distributed  duration.  The  lower  bound 
in  Eq.  (3.3)  shows  that  CA^  with  r  <  1  +  (2xo)_1  is  more  likely  to  have  long-lasting 
competitions  than  CA=,  despite  the  fact  that  the  fitter  agent  is  bound  to  become  the 
ultimate  winner.  We  refer  to  the  phenomenon  that  the  fitter  agent  takes  an  extremely 
long  time  to  win  as  “struggle  of  the  fittest” . 

Figure  3.5  shows  the  duration  of  simulated  CA  competitions  for  various  fitness 
ratios  r.  Each  simulation  curve  is  the  average  of  105  independent  runs  for  109  time 
steps  each.  Note  how  the  distribution  of  duration  jumps  upward  from  the  curve  for 
CA=  to  the  curve  for  CA^  with  r  =  1.1.  It  also  shows  how  the  curves  for  CA^ 


49 


Figure  3.5:  Tail  distribution  for  duration  of  CA  with  various  r.  The  dots  are  simu¬ 
lation  results.  For  r  —  1,  the  solid  line  is  the  asymptote  in  Eq.  (3.2).  For  r  >  1,  the 
solid  lines  are  the  lower  bound  in  Eq.  (3.3)  but  shifted  as  in  Figure  3.4. 


become  more  and  more  tilted  as  r  increases,  being  roughly  parallel  to  the  CA=  curve 
at  r  =  1  +  (2x0)_1  =  1.5. 


3.4.2  Competition  Intensity 

Given  that  CA  competitions  are  long-lasting,  one  might  expect  them  also  to  be 
intense,  i.e. ,  exhibit  many  ties  (Xt  =  Yt).  As  we  will  see  in  this  section,  this  intuition 
is  appropriate  for  CA=  but  not  for  CA^. 


3.4. 2.1  Equal  Fitness  Case:  CA= 

The  following  theorem  shows  that  the  intensity  N  of  CA=  is  heavy-tailed  with  an 
asymptotic  power-law  distribution. 


Theorem  3.4.3.  The  intensity  of  a  CA=  competition  has  the  following  asymptotic 
tail  distribution. 


PwA  >  n] 


-1 


2x0+y0-2  B(x0,y0) 
where  B(x0,y0)  is  the  beta  function  as  in  Eq.  (3.2). 


n 


(3.6) 


In  this  case,  the  intensity  has  infinite  expectation,  as  does  the  duration.  Figure  3.6 
shows  the  duration  distributions  from  simulations  for  various  values  of  initial  wealth, 
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Figure  3.6:  Tail  distribution  for  intensity  of  CA=  with  various  (xo,yo).  The  dots  are 
simulation  results.  The  solid  lines  are  the  asymptotes  from  Eq.  (3.6). 

with  the  asymptotes  in  Eq.  (3.6)  superimposed.  Each  simulation  curve  is  the  average 
of  105  independent  runs  for  10'  time  steps  each.  We  observe  the  same  behavior  as 
in  Figure  3.3.  When  both  xq  and  yo  increase  but  are  kept  equal,  the  distribution 
curve  shifts  upwards,  which  means  the  competition  is  more  intense.  When  the  initial 
wealth  of  only  one  agent  ( yo  here)  increases,  the  distribution  curve  shifts  downwards, 
which  means  the  competition  is  less  intense. 

We  mention  in  passing  that  if  we  have  a  finite  observation  time  tf ,  the  expected 
intensity  Ntf  by  time  tf  grows  as  log  tf,  a  phenomenon  observed  for  the  related  CA 
model  in  Godreche  et  al.  [33]. 

3. 4. 2. 2  Different  Fitness  Case:  CA^ 

In  sharp  contrast,  CA^  competitions  are  not  intense  despite  their  long  durations. 
In  fact  their  intensities  are  surprisingly  mild,  bounded  above  by  a  geometric  distri¬ 
bution,  as  shown  in  the  next  theorem. 

Theorem  3.4.4.  The  tail  distribution  of  the  intensity  of  a  CA ^  competition  has  the 
following  upper  bound, 
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Figure  3.7:  Tail  distribution  for  intensity  of  CA  with  various  r.  The  dots  are  simula¬ 
tion  results.  The  solid  lines  are  the  upper  bounds  from  Eq.  (3.7)  for  r  >  1,  and  the 
asymptote  in  Eq.  (3.6)  for  r  =  1. 


ra>(a:o,j/o) 

rCA,r 


[N  >  n\  <  C 


71—1 


1 


(3.7) 


with 


C  = 


1,  <  Vo, 


where  (x)k  =  nf=o  the  Pochhammer  symbol. 


Note  that  the  expectation  and  all  higher  moments  of  N  are  finite.  Therefore, 
the  intensity  of  a  CA  competition  changes  dramatically  when  the  fitnesses  of  the 
two  parties  become  unequal,  the  distribution  shifting  from  a  power-law  tail  to  an 
exponential  tail.  This  is  illustrated  in  Figure  3.7,  where  each  simulation  curve  is  the 
average  of  105  independent  runs  for  109  time  steps  each.  An  important  observation 
is  that  both  CA^  and  RW^  competitions  have  intensities  that  are  upper  bounded  by 
identical  exponential  tails  (see  Table  3.1),  while  exhibiting  fundamentally  different 
durations. 
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Why  are  CA^  competitions  simultaneously  not  intense  and  long-lasting?  The  an¬ 
swer  resides  in  the  probability  of  Y  being  the  eventual  winner.  In  CA=  competitions, 
Y  wins  with  probability  ?/o/(^o  +  2/o),  while  in  CA^  competitions  Y  (the  less  fit)  never 
wins.  However,  for  small  values  of  r,  especially  for  those  very  close  to  one,  the  dy¬ 
namics  in  the  initial  stages  of  the  competition  closely  follows  that  of  CA=.  Thus  there 
is  a  non-negligible  chance  that  Y  takes  a  significant  lead,  with  the  CA  effect  helping 
it  uphold  the  lead  for  a  long  period  of  time  over  which  there  is  no  tie.  Eventually, 
however,  the  fitness  effect  outweighs  the  CA  effect,  and  X  catches  up  with  Y.  By 
then  they  both  have  large  accumulated  wealth,  which  makes  CA^  behave  like  RW^ 
in  the  vicinity  of  A"  =  Y,  allowing  A"  to  quickly  establish  a  lead  ahead  of  Y.  At 
this  final  stage  both  fitness  and  CA  effects  work  in  favor  of  X,  and  Y  stands  little 
chance  in  taking  the  lead  again.  To  summarize,  the  less  fit  agent  has  a  non-negligible 
probability  of  taking  an  early  lead  which  can  last  for  a  very  long  time  due  to  the 
CA  effect,  but  it  will  ultimately  surrender  the  lead  to  the  fitter  agent  and  never  lead 
again,  a  phenomenon  that  we  call  “delusion  of  the  weakest” ,  which  is  the  flip-side  of 
“struggle  of  the  fittest” . 

Figure  3.8  illustrates  this  observation  by  showing  sample  paths  for  different  values 
of  r,  all  generated  using  the  same  sequence  of  random  numbers.  Note  that  for  r  —  1 
(identical  fitness),  Y  wins  quickly,  whereas  for  r  =  1.1  the  fitter  agent  X,  having 
trailed  behind  for  a  long  time,  eventually  takes  over  after  69,426  time  steps.  Finally, 
for  both  r  =  1.2  and  r  =  1.5  agent  X  has  no  trouble  quickly  winning  the  competi¬ 
tion.  These  sample  paths  showcase  the  long  struggle  of  the  “slightly”  fitter  agent  in 
competitions  with  CA  effects. 

3.4.3  Interplay  of  Duration  and  Intensity 

In  this  section,  we  study  the  relationship  between  duration  and  intensity.  Note 
that  duration  gives  a  natural  upper  bound  N  <  T/2  for  intensity,  i.e.,  the  number 
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Figure  3.8:  “Delusion  of  the  weakest”:  sample  paths  for  different  values  of  r  (ccq  = 
2/o  =  l),  all  generated  using  the  same  sequence  of  random  bits. 

of  ties  is  at  most  half  of  the  duration  in  any  competition.  In  CA=,  duration  and 
intensity  are  strongly  and  positively  correlated.  In  fact,  a  tie  at  time  t  increases  the 
probability  of  having  another  tie  at  a  time  greater  than  t.  More  precisely,  [6]  shows 
that  for  CA=, 

pSm’P’  >  t\X,  =Y,]~1-  -jL=.  (3.8) 

Since  Xt  ~  t/ 2  at  a  tie,  Eq.  (3.8)  implies  that  the  later  a  tie  occurs,  the  more  likely 
another  tie  will  occur,  intuitively  explaining  why  long-lasting  competitions  are  also 
intense  in  this  case. 

Figure  3.9  shows  a  scatter-plot  of  duration  versus  intensity  from  104  independent 
runs  of  CA=  competitions  with  xq  =  yo  =  1,  each  simulated  for  109  time  steps.  This 
unveils  a  strong  positive  linear  correlation  between  the  two  statistics  in  log-log  scale 
(sample  Pearson  correlation  coefficient  of  0.94). 

Interestingly,  CA^  shows  a  different  behavior,  since  even  long-lasting  competi¬ 
tions  exhibit  only  a  small  number  of  ties.  Figure  3.10  shows  simulation  results  for 
conditional  average  intensities  of  competitions  with  x0  =  ?/0  =  1  and  different  fitness 
ratios  r,  conditioned  on  the  duration  being  at  least  t.  Each  simulation  curve  is  ob¬ 
tained  from  104  independent  runs  for  109  time  steps  each.  Note  that  for  r  =  1,  the 
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Figure  3.9:  Scatter  plot  of  duration  vs.  intensity  for  CA=. 


Figure  3.10:  Conditional  average  intensity  of  competitions  conditioned  on  their  du¬ 
ration  being  at  least  t,  namely  E^f ^°\n  \  T  >t]. 


conditional  average  intensity  increases  linearly  with  t,  but  for  r  >  1,  it  stabilizes  as  t 
increases.  Again,  we  observe  a  sharp  transition  as  we  move  from  identical  to  distinct 
fitnesses,  this  time  in  the  correlation  between  intensity  and  duration. 


3.5  Discussion  and  Conclusion 

As  various  empirical  studies  [67,  72,  69,  22,  63,  62]  suggest  that  real  world  com¬ 
petitions  for  resource  accumulation  are  subject  to  cumulative  advantage  effects,  at 
least  to  some  extent,  a  theoretical  understanding  of  the  role  of  skill  and  luck  in  com- 
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petition  dynamics  becomes  a  pressing  issue.  Recent  theoretical  studies  [21,  6,  50,  33] 
have  contributed  in  this  direction. 

However,  contrary  to  prior  theoretical  works,  we  considered  simple  and  classical 
mathematical  models  that  capture  just  the  essence  of  skill  and  luck  competitions  with 
and  without  CA  effects,  and  investigated  fundamental  aspects  of  competition,  namely 
duration  (i.e.,  time  until  ultimate  winner  emerges)  and  intensity  (i.e. ,  number  of  ties 
in  competition).  By  considering  simple  models  and  simple  properties  we  proved  and 
illustrated  fundamental  theoretical  results:  CA  effect  exacerbates  the  role  of  luck  - 
power-law  tail  duration  emerges  regardless  of  skill  differences,  and  become  extreme 
(i.e.,  infinite  mean)  when  skill  differences  are  small  enough.  Moreover,  duration  is 
long  not  necessarily  because  of  intense  competition  where  agents  tussle  aggressively 
for  ultimate  leadership.  On  the  contrary,  under  CA,  competitions  are  generally  very 
mild,  exhibiting  an  exponential  tail.  Long  competitions  emerge  when  an  early  stroke 
of  luck  places  the  less  skilled  in  the  lead,  who  can  then,  boosted  by  CA  effects,  enjoy 
leadership  for  a  very  long  period  of  time.  Thus,  when  CA  is  present  luck  sides  with 
the  less  skilled. 

The  non-negligible  probability  of  long-lasting  competitions  has  far-reaching  impli¬ 
cations.  In  the  absence  of  CA,  it  takes  very  little  time  for  the  fittest  agent  to  establish 
dominance,  so  it  is  often  reasonable  to  neglect  the  possibility  of  a  premature  burnout. 
Such  observations  are  in  hand  with  the  “survival  of  the  fittest”  principle,  since  soon 
enough  the  more  skilled  will  prevail.  In  the  presence  of  CA,  however,  even  agents 
with  superior  fitness  may  face  the  challenge  of  having  to  endure  extremely  long  com¬ 
petitions.  This  challenge  becomes  all  more  real  when  the  fitness  superiority  is  only 
minimal.  Will  the  more  skilled  survive  the  seemingly  eternal  inferiority  during  the 
competition?  Linder  CA  time  becomes  a  central  issue,  with  delusion  becoming  re¬ 
ality  if  the  more  skilled  burns  out  during  a  long  struggle.  Thus,  in  the  face  of  CA, 


56 


the  fittest  survives  only  if  it  can  persist,  which  prompts  us  to  rename  the  principle 
“survival  of  the  fittest  and  persistent”  when  considering  CA  competitions. 

This  observation  may  also  shed  light  on  the  seemingly  inherent  difficulty  of  pre¬ 
dicting  success  in  real-world  competitions  by  observing  ongoing  sample  paths.  Dif¬ 
ferent  empirical  studies  have  alluded  to  this  problem  [67,  72,  69]  as  well  as  recent 
model  driven  studies  [21].  Since  direct  competitions  tend  to  be  relatively  very  short 
and  skill-differences  tend  to  be  small  it  is  well  very  possible  that  the  winner  when 
competition  ended  was  not  the  fittest,  adding  to  the  difficulty  of  making  accurate 
predictions. 
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CHAPTER  4 


INFORMATION  DISSEMINATION  IN  SOCIAL 
NETWORKS  UNDER  LIMITED  BUDGET  OF 

ATTENTION 


4.1  Introduction 

Information  dissemination  has  been  transformed  by  the  emergence  of  online  social 
networks  and  their  enthusiastic  adoption  by  users.  Users  rely  on  trust  relationships 
in  social  networks  for  accessing  information.  Relationships  form  on  the  basis  of  the 
quality  of  information  received,  and  in  turn  determine  the  speed  of  propagation  in 
the  network. 

The  literature  on  information  propagation  in  social  networks,  or  rumor  spread¬ 
ing  [64]  is  wide  and  varied;  we  mention  here  only  those  that  are  most  relevant  to 
the  work  in  this  chapter.  Previous  work  has  mostly  studied  the  propagation  of  a 
rumor  originating  at  a  given  source.  The  typical  model  is  the  randomized  broadcast 
model  [47]  which  is  carried  out  in  synchronized  rounds.  In  each  round,  each  user 
selects  a  neighbor  at  random  and  propagates  a  rumor.  The  spreading  mechanisms 
considered  have  been  broadly  of  three  types:  push  mechanisms  where  the  user  sends 
the  rumor  if  he  has  it,  to  the  chosen  neighbor;  pull  mechanisms  where  the  user  pulls 
the  rumor  from  the  chosen  neighbor;  and  a  combined  mechanism,  where  the  user 
pushes  the  rumor  if  he  has  it  and  pulls  it  if  the  chosen  neighbor  has  it.  Most  previous 
work  focus  on  the  characterization  of  the  delay  in  spreading  a  single  rumor  to  all  nodes 
(e.g.  [20],  [17]).  An  asynchronous  model  is  considered  by  Arnini  et  al.  [4],  where  each 
node  contacts  a  neighbor  after  a  random  amount  of  time;  they  focus  in  particular  on 
random  regular  graphs  and  derive  performance  results  in  terms  of  optimal  delay. 


What  has  been  overlooked  in  most  previous  work  is  the  practical  constraint  that 
each  user  has  a  limited  budget  of  attention,  that  is,  a  limited  frequency  with  which 
he  interacts  with  his  neighbors.  Each  user  then  has  to  allocate  his  limited  budget 
of  attention  among  his  neighbors.  There  is  some  work  on  analyzing  the  allocation 
patterns  using  real  world  data.  In  particular,  Backstrom  et  al.  [8]  analyze  a  data 
set  of  real  measurements  on  how  users  split  their  attention  among  their  friends  on 
Facebook.  They  consider  activities  such  as  communication  and  viewing  and  show  how 
the  balance  of  attention  varies  across  activities  and  other  personal  characteristics. 
Since  the  level  of  attention  affects  information  flow  [49] ,  the  allocation  of  the  budgets 
of  attention  will  have  an  impact  on  information  propagation. 

This  chapter  considers  a  scenario  of  information  propagation  where  there  are  mul¬ 
tiple  information  sources  in  the  network  and  each  user  has  a  limited  budget  of  at¬ 
tention.  Each  user  allocates  his  budget  strategically  with  the  objective  of  timely 
reception  of  news.  We  consider  an  asynchronous  pull  model,  where  each  node  con¬ 
tacts  a  neighbor  after  a  random  delay  and  pulls  any  content  available  at  that  neighbor 
and  the  frequency  with  which  he  pulls  content  from  neighbors  is  limited.  Our  ob¬ 
jective  is  to  identify  optimal  allocations  of  this  limited  frequency  among  neighbors 
for  each  user  in  the  network  and  to  inform  the  design  of  algorithms  for  optimal  in¬ 
formation  spread.  In  particular,  we  want  to  answer  the  question  “when  users  make 
selfish  decisions  on  how  to  allocate  their  limited  access  frequency  among  neighbors, 
does  information  propagate  efficiently?”  We  take  the  approach  of  conceiving  a  gen¬ 
eral  model  for  studying  the  balance  of  attention,  and  an  analysis  for  several  network 
topologies. 

We  investigate  the  efficiency  of  selfish  allocation  by  considering  the  metric  of 
average  end-to-end  delay  of  content  spread.  We  make  the  following  contributions: 

•  We  study  the  efficiency  of  selfish  allocation  of  the  budget  of  attention  by  char¬ 
acterizing  the  price  of  stability  (PoS)  for  several  network  topologies. 
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•  We  identify  topologies  with  inefficient  propagation  under  selfish  allocations. 

•  We  propose  the  “plus-one”  mechanism,  an  incentive  scheme  that  coaxes  users 
into  mimicking  a  gradient-descent  algorithm,  bringing  the  cost  of  content  prop¬ 
agation  closer  to  the  optimal. 

•  We  present  numerical  results  that  compare  the  optimal,  selfish,  and  feedback- 
based  allocations. 

The  rest  of  this  chapter  is  organized  as  follows.  We  define  the  social  network 
model  in  Section  4.2,  and  present  the  analysis  of  optimal  and  distributed  allocation 
over  several  topologies  in  Section  4.3.  In  Section  4.4  we  present  our  feedback-based 
mechanism.  Numerical  results  follow  in  Section  4.5  and  we  conclude  in  Section  4.6. 

4.2  Model  Description 

We  consider  a  social  network  where  each  user  has  a  set  of  friends,  or  contacts,  that 
he  links  to,  or  follows,  in  order  to  get  content  such  as  news  updates,  videos,  or  other 
messages.  Each  user  then  makes  all  content  that  he  holds  available  to  his  followers, 
such  as  on  the  Facebook  wall  or  Twitter  stream.  Rather  than  a  single  source  of 
content,  we  allow  all  users  to  create  content,  at  a  given  rate.  Further,  we  assume 
that  users  seek  to  obtain  all  content  circulating  in  the  network.  Users  consult  their 
contacts  for  the  latest  updates  of  information  with  the  objective  of  minimizing  the 
average  delay  for  obtaining  all  information.  As  in  real  online  social  networks,  users 
have  a  limited  budget  of  attention,  that  is,  the  total  rate  at  which  they  may  consult 
their  contacts  is  limited.  As  such,  this  rate  must  be  allocated  among  the  contacts 
in  a  manner  that  optimizes  for  delay.  We  will  compare  a  centralized  optimization 
of  consultation  rates  with  a  distributed  one  where  users  optimize  the  allocation  in  a 
selfish  manner. 
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We  model  the  social  network  as  a  directed  graph  G  =  ( V,  E ),  where  V  is  the  set 
of  users,  and  E  is  the  set  of  directed  edges  between  users,  i.e.  (i,j)  G  E  if  and  only 
if  there  is  an  edge  from  i  to  j.  Denote  by  N(i)  the  set  of  in-neighbors  of  user  i,  i.e. 
N(i)  =  {j  G  V  :  ( j,i )  G  E},  and  let  d*  =  \N(i)\  be  the  in-degree  of  i.  We  assume 
that  G  is  strongly  connected. 

Each  user  i  G  V  creates  contents  according  to  a  Poisson  process  with  rate  A i  >  0. 
When  the  A,’s  are  the  same,  we  denote  the  common  value  by  A.  User  i  consults  his 
in-neighbor  j  G  N(i)  according  to  a  Poisson  process  with  rate  x^.  Each  user  has  a 
limited  budget  of  attention  6j  >  0,  so  that  the  rates  of  consultation  are  constrained 
by  YljeNU)  xji  =  hi-  When  the  bi  s  are  the  same,  we  denote  the  common  value  by  b. 
In  terms  of  the  normalized  rates  ijji  =  Xji/bi ,  the  budget  constraint  becomes 

yn  =  L  (4-1) 

jeN(i) 

The  vector  y =  {yri  :  j  G  N(i)}  represents  how  user  i  allocates  his  budget  of  attention 

among  his  neighbors,  which  will  be  referred  to  as  his  strategy.  The  set  of  all  possible 

strategies  of  user  i  is  the  unit  simplex  A*  in  Mdi.  Let  y  =  {y*  :  i  G  V}  be  the  strategy 
profile  of  the  network,  and  A  =  Xiey  \  is  fiie  set  °f  all  possible  prohles.  Also  let 
y _i  be  the  strategy  prohle  of  all  users  other  than  i. 

For  any  i  ^  j,  define  DJt(y)  to  be  the  delay  for  user  i  to  receive  content  originated 
at  user  j  under  prohle  y.  Define  the  cost  for  user  i  to  be 

Ci(y)  =  T-  J2  tE%(y).  (4-2) 

*  iev\{i} 

where  A _j  =  ^ji  ancl  define  the  social  cost  to  be 

=  E  VED3.(y),  (4.3) 

1  ’  iev  jev\{i} 
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where  A  =  ^2ieV  A*.  Note  that  both  (4.2)  and  (4.3)  are  independent  of  any  normal¬ 
ization  of  the  Aj’s,  which  just  translates  a  change  of  units.  We  set  A  =  1  when  content 
creation  rates  are  homogeneous. 

Let  C*  =  minygA  C{ y)  be  the  optimal  social  cost,  and  y*  a  profile  that  minimizes 
the  social  cost  C.  Let  T  C  A  be  the  set  of  profiles  that  are  Nash  equilibria  when 
each  user  minimizes  his  own  cost  in  a  selfish  manner.  In  general  there  might  exist 
multiple  equilibria.  A  standard  measure  of  inefficiency  of  equilibria  is  the  price  of 
anarchy  (PoA).  It  is  defined  as  the  ratio  of  the  cost  at  the  worst  equilibrium  to  that 
of  an  optimal  outcome,  i.e. ,  maxygr  C(y)/C*.  We  are,  however,  interested  in  the  best 
equilibria,  which  would  give  us  a  benchmark  of  what’s  achievable  through  distributed 
means.  As  such,  we  focus  on  the  price  of  stability  (PoS),  defined  as  the  ratio  of  the 
cost  at  the  best  equilibrium  to  that  of  an  optimal  outcome  [58],  i.e., 

PoS  =  min  (4.4) 

yer  C*C* 

where  C  =  minygrC'(y)  >  C*  is  the  minimum  social  cost  under  a  selfish  allocation. 
Even  though  the  PoS  can  be  seen  as  a  weaker  notion  of  inefficiency,  we  fold  it  more 
interesting  in  a  practical  sense,  since  it  gives  us  a  target  performance  for  the  design  of 
distributed  algorithms.  In  some  cases,  e.g.,  in  a  tree  network,  the  Nash  equilibrium 
is  unique,  so  the  PoS  coincides  with  the  PoA. 

In  what  follows,  we  will  measure  inefficiencies  in  the  selfish  allocation  of  the  budget 
of  attention  in  several  network  topologies.  We  will  show  that  some  topologies  lead 
to  large  inefficiencies.  In  Section  4.4  we  propose  a  feedback-based  mechanism  that 
results  in  a  distributed  allocation  that  has  a  cost  closer  to  the  optimal  cost  than  does 
the  selfish  allocation. 
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4.3  Efficiency  Analysis 

We  will  now  study  the  efficiency  of  selfish  optimization  on  several  social  network 
topologies.  The  interest  in  studying  various  topologies  lies  not  only  in  understanding 
how  existing  social  networks  with  those  topologies  perform,  but  also  in  identifying 
efficient  structures  for  information  dissemination.  The  latter  may  inform  smart  design 
for  information  propagation. 


4.3.1  Tree  Network 

We  first  consider  tree  topologies.  Such  structures  are  interesting  as  networks 
since  information  dissemination  can  be  locally  tree-like.  Let  G  be  a  tree;  for  all  tree 
networks  we  study,  we  will  assume  G  is  undirected.  Let  7}*  denote  the  component 
containing  j  when  an  edge  (j,  i)  G  E  is  removed.  Let  XJt  =  Y^keTji  ^ k  fiie  aggregate 
content  creation  rate  of  the  nodes  in  Tjt ,  and  =  | Tyi  |  the  number  of  nodes  in  . 

For  i  ^  j ,  let  Vj~>i  be  the  unique  shortest  path  from  j  to  i.  The  average  delay  for 
user  i  to  get  contents  originated  in  j  is  then  EDr,(y)  =  J2eeVj^.  xj1,  where  xe  =  xuw 
for  an  edge  e  =  ( u,w ).  Thus  the  cost  for  user  i  is 


Ci(y) 


A_ 


1  E  t 


e£Pj 


1 

A -A 


E 

keN(i) 


Xki 

Uki 


+  /(y-i)-, 


where  f(y~i)  represents  terms  that  do  not  depend  on  y,:.  Note  that  f(y~i)  can  be 
inhnity  for  some  y_*  on  the  boundary  of  A _j.  However,  since  users  are  trying  to 
collect  all  the  information,  even  selfish  users  have  incentives  to  keep  y  in  the  interior 
of  A  so  that  no  information  pathway  is  effectively  cut  off.  Therefore,  we  will  assume 
y  is  in  the  interior  of  A,  and  consequently  f(y~i)  is  finite. 

If  user  %  selfishly  minimizes  C'j,  the  unique  best  rate  allocation,  irrespective  of  how 
others  allocate  their  budgets  of  attention,  is  given  by 
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Vji  =  ^  j^r— ,  for  j  e  N (i).  (4.5) 

2^  V  'Ui 

keN(i) 

Note  that  in  this  case  each  user  has  a  unique  best  selfish  strategy  independent  of 
others’  strategies.  Thus  there  is  a  unique  Nash  equilibrium  and  the  PoS  coincides 
with  the  PoA. 

The  social  cost  can  be  written  as  follows, 


C  = 


1  E  1 


(n  —  1)A  z 


i£V  j:i^j  eeVj 


XP 


(n  —  1)A  Z 

(u,w)GE  i€.V  j:i^j 

1  \  A  'fl'WU^UW 

(n-1) A  ,  VuwK 

(u,w)£E 

1  \  -v  1  \  -v  TlikXki 

(n  —  1)A  ^  bi  yki 

v  ’  iev  *  keN(i)  y 


Thus  the  social  cost  under  selfish  allocation  (4.5)  is  given  by 


C  -  („  _  1)A  E  bj  I  E  v  'V  IIZ.V  "1. 

V  J  i€V  1  VjGJV(i)  /  \j£N(i) 


E 


(4.6) 


However,  the  socially  optimal  rate  allocation  is  given  by 


Vji 


\Zriij  Xji 


y!  A 

fceW(i) 


,  for  j  6  Af(i), 


(4.7) 


with  the  optimal  social  cost  being 


C*  (n  _  i)A  Z  6  (  Z 

v  '  ier  1  Vjeivp) 


(4.8) 
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We  will  now  study  in  more  detail  specific  tree  structures:  the  line,  the  k- ary  tree, 


and  the  chained  star  networks. 


4. 3. 1.1  Line  Network 

Suppose  G  is  a  line  network,  with  V  =  {l,...,n}  and  (i,i  +  1)  e  E  for  i  = 
1,  2, . . . ,  n  —  1  and  (i,  %  —  1)  E  E  for  i  =  2, ,  n.  Theorem  4.3.1  below  gives  bounds 
on  the  range  of  C*  and  C  and  an  upper  bound  on  the  PoS.  Note  that  the  upper 
bound  on  PoS  does  not  depend  on  the  content  creation  rates  A,.  When  the  budgets 
of  attention  are  homogeneous,  we  have  PoS  <  5,  but  C*  =  @(n/6). 

Theorem  4.3.1.  In  a  line  network  of  n  >  2  nodes,  the  optimal  social  cost  and  the 
cost  under  selfish  allocation  are  bounded  as  follows, 


ra+1 

8^max 


<  c*  <  c  < 


71+1 

^min 


9  1  A 

8  +  4  n  -  4  )  ’ 


(4.9) 


and 

PoS  <  (4.10) 

^min 

where  6max  =  max*  bi  and  bmin  =  min* 

Proof.  See  Appendix  D.l.  □ 

When  the  budgets  of  attention  are  heterogeneous,  the  upper  bound  in  (4.10)  can 
become  arbitrarily  large.  In  this  case,  the  PoS  can  be  arbitrarily  large  as  well,  as 
shown  in  Theorem  4.3.2. 

Theorem  4.3.2.  The  PoS  can  be  arbitrarily  large  when  the  budgets  of  attention  are 
heterogeneous  in  a  line  network. 

Proof.  See  Appendix  D.2.  □ 
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4.3. 1.2  Chained  Star  Network 


Consider  k  star  networks,  each  with  p  nodes.  The  hubs  of  the  stars  are  chained 
to  form  a  line  network,  with  a  total  of  n  =  pk  nodes.  Such  topologies  are  not 
uncommon  in  social  networks  based  on  communities.  Such  structure  might  correspond 
to  communities  focused  on  given  topics  or  interests,  that  are  then  connected  to  the 
larger  social  network.  Theorem  4.3.3  shows  that  while  the  optimal  social  cost  can  be 
large,  the  PoS  is  of  order  1  in  the  homogeneous  case. 

Theorem  4.3.3.  In  a  chained  star  network  with  n  users,  homogeneous  content  cre¬ 
ation  rates  A  =  1  and  homogeneous  budgets  of  attention  b,  the  optimal  social  cost  and 
cost  under  selfish  allocation  satisfy 

C*  =  0  (6'1  maxjp,  k})  =  Vl^b^^/n), 

C  =  ©  (6_1  maxjp,  k })  , 

with  PoS  =  0(1). 

Proof.  See  Appendix  D.3.  □ 

Remark.  Suppose  6=1.  As  p  changes  from  0(1)  to  @(n),  C*  can  have  any  order 
between  0(i/n)  and  0(n).  In  particular,  C*  =  O(n)  for  p  =  1  and  p  =  n,  which 
correspond  to  the  line  and  star  networks,  respectively. 

4.3. 1.3  k- ary  Tree  Network 

We  now  consider  rooted  trees  where  each  node  has  k  children.  Such  structures 
are  of  interest  for  social  networks  with  few  edges,  as  sparse  random  graphs  are  locally 
tree-like.  Theorem  4.3.4  below  states  the  PoS  for  k- ary  trees.  Corollary  4.3.5  shows 
that  this  PoS  can  be  arbitrarily  large  for  k  of  constant  order  and  as  k  scales  sublinearly 
with  n,  even  if  the  content  creation  rates  and  budgets  of  attention  are  homogeneous. 
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Theorem  4.3.4.  In  a  k-ary  complete  tree  with  n  users,  homogeneous  content  creation 
rates  A  =  1  and  homogeneous  budgets  of  attention  b,  the  optimal  social  cost  and  the 
cost  under  selfish  allocation  satisfy 


C*  =  ©  (b~1klogkn)  , 


C  =  0  {b  1k  logfc  n  +  b  l\/n)  , 


with 


PoS 


C 

c* 


0 


Proof.  See  Appendix  D.4. 


□ 


Corollary  4.3.5.  Let  b  =  1  and  k  =  0(na). 

(1) .  If  a  =  0,  i.e.  k  =  0(1),  the  costs  are  C*  =  0(logn),  C  =  @(>/n),  and 

PoS  =  0(i/n/logn). 

(2) .  7/0  <  a  <  1/2,  the  costs  are  C*  =  0(nQ),  C  =  and  PoS  =  0  (n1/2'") . 

(3) .  7/a  >  1/2,  the  costs  are  C*  =  0(n"),  C  =  @(nQ),  and  PoS  =  0(1). 

The  above  corollary  verifies  the  intuition  that  long  thin  networks  are  less  efficient 
for  information  propagation  than  wide  networks.  The  former  type  of  network  thus 
would  require  an  incentive-based  mechanism  to  make  them  more  efficient.  We  will 
propose  one  such  mechanism  in  Section  4.4. 


4.3.2  Clique  Networks 

We  now  consider  the  case  where  G  is  a  clique.  This  is,  in  some  sense,  the  best- 
case  scenario,  where  users  have  the  widest  choice  possible  in  allocating  their  budget 
of  attention.  The  analysis  is  more  involved  since  there  are  many  paths  between  each 
source-destination  pair,  creating  a  more  complicated  dependency  structure  between 
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the  various  links.  We  consider  efficiency  for  the  homogeneous  case,  where  A*  =  1  and 
bi  =  b.  We  will  show  that  there  exists  a  selfish  profile  that  is  asymptotically  optimal, 
and  thus  the  price  of  stability  is  bounded  and  approaches  one  as  the  network  size 
increases. 

Let  us  consider  the  uniform  strategy  for  user  i  G  V,  where  the  consultation  rate 
is  Uji  —  uji  —  b/di  for  all  j  G  N(i).  Let  rq  =  {uji  :  j  G  N(i)}  and  u_j  =  {iij  :  j  € 
V  \  {i}}.  The  profile  u  =  {iij  :  %  G  V}  is  referred  to  as  the  uniform  prohle.  For  a 
clique  network,  Uji  =  b/(n  —  1)  for  all  i  ^  j. 

Theorem  4.3.6.  For  an  n-clique  with  homogenous  content  creation  rates  A  =  1  and 
budgets  of  attention  b,  the  social  cost  Cu  of  the  uniform  profile  is  Cu  =  b~lF[n_  1} 
where  Hn  =  Y^k=i  ,  the  n-th  harmonic  number. 

Proof.  We  follow  the  approach  in  [41].  Let  be  the  delay  for  a  content  originating 
from  j  to  reach  at  least  m  other  users,  i.e.  Dj(m)  is  the  m~ th  order  statistic  of 
{Dji  :  i  G  V  \  {j}}  for  a  given  j.  Let  Vjk  =  Dj(k)  —  ^j{k- 1),  with  the  convention  that 
Dj(o)  =  0.  Thus 

n— 1  n— 1  m  n— 1 

E  Di‘  =  E  B*-”>  =  E  E  v*  =  E<"  -  w*-  (4-n) 

m=  1  m=  1  fc=l  fc=l 

Note  that  Vjk  is  exponentially  distributed  with  parameter  -^k(n  —  k ).  Taking  ex¬ 
pectations  of  (4.11)  we  get: 

n— 1  n— 1  _  ^ 

X  el>*  =  E(«  -  k)EV =  E  ^ 

k=  1  k= 1 

Summing  over  j  and  dividing  by  n(n— 1),  we  obtain  the  social  cost  Cu  =  b~1Hn_i.  □ 

The  next  theorem  shows  that  the  uniform  stratety  by  a  given  user  is  the  best 
response  when  all  other  nodes  follow  the  uniform  strategy,  showing  that  this  is  a 
Nash  equilibrium. 


Theorem  4.3.7.  For  an  n-clique  with  homogenous  content  creation  rates  1  and  bud¬ 
gets  of  attention  b,  the  uniform  profile  is  a  Nash  equilibrium. 

Proof.  Suppose  users  i  =  1,2, . . .  ,n  —  1  follow  the  uniform  strategy  and  consider  user 
n.  For  a  given  j, 

Djn{ y„,  u_n)  =  min  {  —Xkn  +  Ynk  \  , 

1 <k<n  1  ^  ykn  J 

where  {XknYlz\  are  i.i.d.  exponential  random  variable  with  parameter  b,  and  Yfl 
is  the  time  for  an  item  originating  from  j  to  reach  k  without  passing  through  n. 
Note  that  are  independent  of  {Yfl\f=l .  Let  T”  =  be  the  or¬ 

der  statistics  of  Y”  =  {Y'fk}nkz\ .  By  symmetry,  the  random  vectors  {Yf[, . . . ,  Yff_ j ) 
and  (Tfa(]  y  . . . ,  Tfa(n_yf  are  identically  distributed,  where  a  is  a  permutation  chosen 
uniformly  randomly  from  the  symmetric  group  Sn_ i ,  independently  of  T".  Therefore, 


P[Djn(yn,u_n)  >  x  |  Tf 


=  P 


n—  1 


=  E 


,k= 1 


exp 


n  {Xu  >  »„(*  - 1™  w)} 

n—  1 

b  ^  ^  VkiiiF  -E/'<r(fc)) 

fc=l 

>  exp  <j  -b  y^  yknEa(x  -  Tja(k))+ 


+ 


n— 1 


k=  1 

n—  1  n— 1 


=  exH  -bJ^UKnJ^  — 


k= 1  Z=1 

n— 1 


n  —  1 


r,,)- 


«=1 


=  P  [T>jn(un,  u_n)  >  x  |  T?]  , 


where  the  inequality  follows  from  Jensen’s  inequality.  Therefore,  for  j  =  1,2, ... ,  n—1, 


E  [Djn(yn,  U_n)  >  x]  >  P  [Djn(un,  U_n)  >  X  . 
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Integrating  over  x  from  0  to  oo,  we  obtain  EDjn(yn,  u_„)  >  EDjn(un ,  u_n)  and  hence 
Cn( y„,  u_n)  >  Cn( un,  u_n)  for  any  yn  G  An.  Therefore,  the  uniform  profile  is  a  Nash 
equilibrium.  □ 

The  next  theorem  shows  that  the  cost  of  the  uniform  profile  is  larger  than  the 
optimal  social  cost  by  at  most  1/6.  Thus  the  price  of  stability  approaches  one  as  the 
network  size  increases. 


Theorem  4.3.8.  In  any  network  of  size  n  where  every  user  publishes  at  rate  A  =  1 
and  has  a  budget  of  attention  b,  the  optimal  social  cost  for  any  user  i  is  lower  bounded 


by 


C*  >  - - -Hn_ i 

b(n  —  1) 


T  >  CV 
b 


1 

b' 


Proof.  The  argument  uses  a  backwards  growth  process  similar  to  that  used  in  Section 
V  of  [39].  Assume  the  process  is  in  steady  state,  i.e.  the  process  started  at  —  oo. 
Consider  user  i.  Let  £>*(£)  be  the  set  of  users  whose  states  at  time  —t  reach  user  i  by 
tirneO.  Note  that  -Bj(O)  =  {i}.  By  stationarity  and  the  independence  of  the  publishing 
and  consulting  processes,  P [Dji  >  t\  =  P[j  f  B^t)].  Now  let  Ajt  =  inf {t  :  j  e  Bft)}. 
Note  that  {j  ^  B^t)}  =  {A^  >  t}.  Thus  P [D^  >  t\  =  P[AJt  >  t],  i.e.  Drt  and 
Aji  are  identically  distributed.  Now  let  A^)i  =  inf {t  :  \Bi{t)\  —  k  +  1}.  Note  that 
:  1  <  k  <  n  —  1}  are  the  order  statistics  of  {Aji  :  j  e  {1, . . .  n}  \  {i}}. 

If  we  reverse  the  arrow  of  time  at  time  0,  then  Bi(t)  is  the  set  of  infected  users  at 
time  s  in  the  SI  epidemic  spreading  model  where  an  infected  user  j  contaminates  a 
susceptible  user  I  at  rate  by^j.  It  then  follows  that  A  A^kg  —  A^k_ig  is  exponen¬ 
tially  distributed  with  parameter  pki  given  by 


A ffc*  =  b  ^2  Vjt  <  6|5i(^4(fc_i)i)|  =  bk, 
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and  hence  E Wm  >  (bk)  1.  It  follows  that 

Ci  =  — -  e  =  — 7  E 

n  —  1  n  —  1 

r-3&  j'-jfr 

^  n— 1  ^  n—1  k 

=  — t  E  EAw  =  — 7  E 

n  —  l  -<L—'  n  —  1 

k=  1  fc=l  e=l 

>  1  w  1  _  n  H  _I 

~  n  —  1  bin  —  1)  n  1  b 

k= 1  e=l  v  ' 

The  desired  result  follows  by  averaging  over  Ci.  □ 

4.3.3  Expander  Network 

We  now  consider  a  network  characterized  by  an  expander  graph,  which  might  be 
considered  more  realistic.  An  expander  graph  is  a  sparse  graph  with  strong  connec¬ 
tivity  properties.  An  example  is  a  d-regular  graph  which  is  often  used  in  modeling 
social  networks. 

Finding  a  Nash  equilibrium  when  the  topology  is  an  expander  graph  turns  out  to 
be  quite  complex.  We  thus  consider  approximate  Nash  equilibria.  We  define  a  user’s 
strategy  to  be  an  e-approximate  NE  if  the  cost  to  the  user  under  this  strategy  is  no 
worse  than  e  more  than  the  cost  of  any  other  strategy  [58] .  More  formally,  the  profile 
y  is  an  e-approximate  NE  if  for  any  k  and  any  y*,  e  A*,, 


Cfc(y)  <  e  +  Cfc(yfc,y_fc). 

The  e-approximate  price  of  stability  is  defined  by  (4.4)  with  T  replaced  by  Te,  the  set 
of  e-approximate  NE. 

Suppose  G  is  an  expander  network  with  bounded  degree.  We  now  show  that  the 
profile  where  users  implement  a  uniform  allocation  of  their  budget  of  attention  is  an 
approximate  NE. 


71 


Theorem  4.3.9.  In  any  network  with  homogeneous  content  creation  rates  A  and 
budgets  of  attention  b,  the  uniform  profile  is  a  YY. approximate  Nash  equilibrium, 
where  d  =  max*,  dk  is  the  maximum  degree  of  the  graph. 

Proof.  Consider  user  I.  For  a  given  j, 


Djt{  yt,y~e) 


min 

keN(£) 


1 

VU 


X 


U 


+  Yfk 
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where  {Xu}  are  i.i.d.  exponential  random  variable  with  parameter  b ,  and  Y-k  is  the 
time  for  an  item  originating  from  j  to  reach  k  without  passing  through  t.  Define  a 
random  variable  K  by  K  =  min{fc*  :  Y-'k*  =  min*.  Yjk}.  Since  {Yjk}  are  independent 
of  { Xu },  so  is  K.  Thus 


EXKe  =  £  nXkt]  ■  V[K  —  k]  —  b~\ 

k 


We  then  have  the  following, 


ED^(u^,y_€) 


E 


min 

k&N(i) 


{  diXk£ 


+  4) 


<  E  d(XK( 


min  YL 

k£N(£)  J 


On  the  other  hand, 


^Dje(ye,y-i) 


E 


min 

k&N(t) 


>  E 


min 

keN{l) 


—Xu  +  Yk 
yu 


— Xu 

yu 


+  min  Yfk 

k&N{t)  J 


1  ^ 
7  +  E 
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min  Y 


k&N(l) 


jk 


Thus 


d  1 

EDji(ut,y-i)  <  - \-EDji{yt,y-t). 

Summing  over  j  and  then  minimizing  over  y^,  we  obtain 


Ce{ ut,y-t)  < 


dt  —  1 


b 


+  min  Ce( yt,y~i), 

ye£Ae 
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for  any  y_*. 


□ 


In  view  of  Theorem  4.3.8,  the  next  theorem  shows  that,  for  an  expander  network 
with  edge  expansion  bounded  away  from  zero,  the  uniform  profile  is  order  optimal, 
and  hence  the  (/-approximate  price  of  stability  is  bounded. 

Theorem  4.3.10.  The  social  cost  Cu  of  the  uniform  profile  is  bounded  by 


Cu  < 


2d 


bk 


H 


G 


K2J  i 


where  d  =  max*  di,  and  he  is  the  edge  expansion  of  G  defined  by 

,  •  \dA\ 

hG  =  nun  .  r  |  .  |  |  .  i  , 

|A|cv  nnn{|H|,  |HC|} 

with  d A  =  {(u,v)  G  E  :  u  e  A,  v  G  Ac}. 

Proof.  The  proof  is  essentially  the  same  as  that  for  Theorem  4.3.6.  The  difference  is 
that  the  exponential  random  variable  Vjk  in  (4.11)  now  has  parameter  /ijfc  given  by 

hjk  =  b  ^  ijuv  >  ~f\dFjk\  -  ~f  niin{/e,  n  -  k}, 

where  Fjk  is  the  set  consisting  of  the  first  k  users  that  has  got  the  content  originating 
from  j,  with  Fi  i  =  {j}.  Hence 


n— 1 


(n  —  1  )d 


N  E Dji  <  \  - 

bhG  mini k,  n  —  k\ 

iii^j  k= 1 


< 


2  d{n  —  1) 


bh 


H\ 


G 


Ln/2J  ■ 


The  desired  result  follows  by  summing  over  j  and  dividing  by  n{n  —  1). 


□ 
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4.4  Incentivizing  Efficient  Behavior 

As  we  have  just  seen,  topologies  can  be  classified  according  to  their  performance 
under  optimal  and  selfish  allocations  as 

(1) .  Efficient:  These  topologies  have  bounded  PoS  and  optimal  social  cost  of  order 

log  n;  they  do  not  require  additional  mechanisms.  Examples  are  expanders  and 
cliques. 

(2) .  Inefficient  amenable:  These  topologies  have  high  PoS  yet  low  (logarithmic) 

optimal  social  cost.  As  we  shall  show,  incentive  mechanisms  can  be  introduced 
to  change  users’  behavior  and  reduce  their  otherwise  inefficient  performance. 
Examples  are  the  k- ary  tree  with  bounded  degree  (k  =  ©(1)),  and  with  low- 
scaling  degree  ( k  =  Q(na),a  <C  1/2). 

(3) .  Inefficient  suboptimal:  These  topologies  show  inefficient  content  spread  even  un¬ 

der  socially  optimal  allocations.  No  mechanism  that  preserves  the  topology  and 
the  budgets  of  attention  can  therefore  lead  to  good  performance.  Examples  are 
line  and  star  networks,  and  /c-ary  trees  with  high-scaling  degree  ( k  =  Q(na),a 
close  to  1/2  and  a  >  1/2). 

We  now  propose  an  incentive  mechanism  that  will  prove  particularly  appealing  for 
inefficient  amenable  graphs. 

4.4.1  The  Plus-One  mechanism 

Our  mechanism  relies  on  using  incentives  as  a  form  of  feedback  for  reallocating 
attention.  Each  receiver  k,  upon  receiving  useful  information,  sends  a  reward  of  1, 
that  we  call  a  +1,  to  each  node  involved  in  relaying  this  information  from  its  source  s. 
Note  that  by  useful  information,  we  mean  that  the  piece  of  information  that  arrived 
earliest  at  r,  among  all  copies  of  this  information  at  r.  Therefore,  a  node  sends  a  +1 
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to  the  neighbor  through  which  the  earliest  copy  was  received.  We  now  provide  details 
of  this  mechanism. 

•  Each  receiver  k,  upon  reception  of  useful  information  from  source  s,  sends  a  +1*. 
to  each  node  along  the  path  to  s  that  was  involved  in  relaying  that  piece  of  infor¬ 
mation.  Each  node  i  along  this  path  then  keeps  a  score  O*  =  k  (ji)ev* 
where  V*^k  is  the  quickest  (shortest)  path  from  s  to  k.  Note  V*^k  is  random 
and  differs  from  one  sample  path  to  another. 

—  The  receiver  is  not  required  to  know  the  topology  of  the  network  nor  the 
path  to  each  source.  A  completely  distributed  implementation  consists  of 
each  receiver  k  sending  a  +1*,  to  the  neighbor  through  which  it  received  the 
useful  information  from  s.  Each  node  along  the  path  would  then  aggregate 
the  +ls  it  receives  along  with  its  own  +1  before  sending  the  sum  up  to  its 
neighbor. 

•  At  time  intervals  that  are  much  longer  than  the  slots  over  which  +ls  are  sent, 
each  user  i  updates  his  allocation  rates  as  follows, 

y,i (t  + 1)  =  »,(*)  -  7.  (m«)  -  .  (4.12) 

where 

Sji(t)  =  — 7 - \  ,  20), 

nyn  -  1  )yji{t)2  J 

and  with  such  that  YltLo  It  =  lim^oo  =  0. 

We  refer  to  the  +ls  as  incentives  because,  as  feedback,  they  represent  the  impor¬ 
tance  of  a  link,  thus  the  value  of  the  incentive  to  provide  to  bring  about  a  favorable 
change  in  that  link’s  allocation.  In  the  present  chapter  we  keep  these  incentives  in 
quite  general  forms,  but  they  may  be  regarded  as  monetary  or  non-monetary.  Non¬ 
monetary  incentives  might  include  a  form  of  reputation  or  recognition,  such  as  in 
networks  like  Klout  [2],  In  such  networks,  users  receive  votes  that  count  towards 
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their  reputation  or  expertise,  in  return  for  some  service  (like  answering  questions) 
they  provide  to  other  users.  A  gain  in  reputation  incites  users  to  respond  favor¬ 
ably  when  there  is  a  possibility  of  receiving  such  votes.  Note  that  such  a  method 
incentivizes  a  user  to  “serve”  other  users,  thus  going  beyond  a  selfish  allocation  of 
attention. 

The  behavior  induced  by  the  Plus-One  mechanism  turns  out  to  perform  a  stochas¬ 
tic  gradient  descent.  The  +ls  collected  by  a  node  that  correspond  to  a  link  e  indeed 
serve  to  estimate  the  gradient  of  the  cost  with  respect  to  the  allocation  on  link  e,  xe. 
We  now  show  how  <5™  is  an  estimate  of  the  gradient  of  the  cost:  4^-.  For  ease  of 
exposition  we  assume  homogeneous  budgets  of  attention  (A  =  1  for  all  i)  and  content 
creation  rates  (A*  =  1  for  all  i).  Let  {Xe  :  e  G  E}  be  a  collection  of  i.i.d.  exponential 
random  variable  with  parameter  1.  The  expected  delay  from  source  s  to  user  k  can 
be  written  as  follows, 


E.Dsfc(y)  =  E 


E  W 


Since  the  Xe’s  are  different  with  probability  one,  for  each  realization  of  the  AVs,  the 
shortest  path  V*^k  remains  fixed  for  small  enough  perturbations  in  the  ye’s.  The 
gradient  of  the  average  delay  from  s  to  k  with  respect  to  edge  (j,  i )  can  be  estimated 
as  follows, 
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where  is  the  indicator  of  A.  From  (4.3),  the  social  cost  is 


C{  y) 
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n(n  —  1) 


y:  e  Dsk. 

kEV  s:s^k 


Thus  the  gradient  of  the  overall  cost  is  then  given  by 
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An  estimator  8„  of  -jp-  can  then  be  written  as  follows, 

J  C'Vji 

^  n(n  -  1 

v  s,k 

Note  that  corresponds  exactly  to  Sji,  with  O*  =  J2sk  1  {j,i)&v*^k- 

A  study  of  the  convergence  properties  of  the  Plus-One  mechanism  requires  a  care¬ 
ful  analysis  of  the  interchange  of  expectation  and  differentiation,  which  will  not  be 
pursued  in  the  present  chapter.  Indeed  results  from  simulations  presented  in  Sec¬ 
tion  4.5  show  convergence  for  all  graphs  that  we  consider. 

4.4.2  Inefficient  Suboptimal  Graphs 

The  Plus-One  mechanism  performs  well  for  the  inefficient  amenable  graphs:  the 
PoS  is  reduced  and  the  cost  under  this  distributed  mechanism  is  very  close  to  optimal, 
as  we  will  see  in  Section  4.5.  For  inefficient  suboptimal  graphs,  however,  regardless 
of  the  PoS,  the  optimal  social  cost  is  still  quite  high.  Our  results  from  Section  4.3 
show  that  line  and  star  networks,  and  fc-ary  trees  with  k  =  0(nQ),a  >  0,  fall  in 
this  category  of  graphs.  For  such  topologies,  incentive  mechanisms  that  promote  only 
shifting  of  attention  are  not  sufficient.  More  complex  mechanisms  that  change  the 
graph  structure,  or  modify  the  budgets  of  attention  of  some  nodes  would  seem  more 
suitable.  We  leave  the  study  of  such  mechanisms  for  future  work. 

4.5  Simulation  Results 

We  now  perform  simulations  to  verify  our  efficiency  results  and  validate  the  Plus- 
One  (PO)  mechanism.  For  each  network  topology,  we  ran  a  discrete-event  simulation 
for  three  different  scenarios.  In  the  first  scenario,  the  PO  mechanism  is  implemented, 
so  that  the  users  are  incentivized  to  minimize  the  social  cost.  In  the  second  scenario, 
the  users  behave  selfishly,  so  they  optimize  only  for  their  own  cost.  The  tuning  of 
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their  allocation  of  attention  is  similar  to  the  PO  mechanism  except  that  since  there 
is  no  reward  from  the  downstream  users,  there  is  only  a  local  optimization.  In  the 
first  two  scenarios,  the  initial  allocation  of  attention  is  uniform  for  all  users,  which  is 
reasonable  without  prior  knowledge  of  the  network.  In  the  third  scenario,  the  users 
do  not  tune  their  allocation,  and  maintain  the  uniform  strategy. 

We  set  the  each  user’s  publishing  rate  A,  to  A  =  1  and  the  budget  of  attention  bi  to 
b  =  1.  The  users  update  their  allocation  every  100  time  units.  We  ran  the  simulation 
for  length  of  time  long  enough  so  that  the  network  reaches  steady  state  as  illustrated 
in  Figure  4.1,  which  shows  the  average  delay  over  time  in  a  complete  ternary  tree 
with  1093  nodes.  The  average  is  taken  within  a  window  of  100  time  units,  the  same 
as  the  interval  between  successive  updates.  We  observe  convergence  around  a  small 
set  of  values  in  all  scenarios  for  all  topologies  we  consider. 


Figure  4.1:  Average  delay  over  time  in  a  complete  ternary  tree  with  1093  nodes. 


We  now  study  the  steady-state  average  delay  for  various  network  topologies.  For 
almost  all  cases  we  will  plot  the  average  delay  derived  from  both  theoretical  and 
simulation  results.  Figure  4.2  plots  the  average  delay  over  increasing  network  size 
for  a  line  network.  The  PO  mechanism  indeed  improves  upon  the  selfish  allocation, 
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achieving  a  cost  close  to  the  theoretical  optimal  social  cost.  Note  however  that  the 
optimal  social  cost  scales  linearly  with  the  network  size.  This  line  network  is  an 
example  of  inefficient  suboptimal  graphs  that  needs  additional  mechanisms  beyond 
PO  to  improve  the  linear  cost. 


Figure  4.2:  Average  delay  against  network  size  for  a  line  network. 

Figure  4.3  plots  the  average  delay  against  network  size  for  complete  k-aiy  trees 
with  k  —  4.  The  PO  mechanism  achieves  the  theoretical  optimal  social  cost,  which 
scales  logarithmically  with  the  network  size.  The  social  cost  of  the  uniform  strategy 
is  only  slightly  higher  than  the  optimal  social  cost,  though  the  gap  is  increasing  as  the 
network  size  increases.  In  contrast,  the  cost  for  the  selfish  optimization  is  significantly 
higher,  and  increases  much  faster  (~  n 1//2)  with  the  network  size.  This  topology  is  an 
example  of  an  inefficient  amenable  graph  shows  the  power  of  the  PO  mechanism,  in 
bringing  the  social  cost  down  to  one  very  close  to  the  optimal  cost. 

Figure  4.4  plots  the  average  delay  in  random  3-regular  networks.  Since  random 
d-regular  graphs  have  good  expansion  property  with  high  probability,  we  know  from 
Theorems  4.3.9  and  4.3.10  that  the  uniform  strategy  achieves  a  social  cost  that  scales 
logarithmically,  and  that  it  is  a  3-approximate  NE.  Figure  4.4  shows  that  the  costs 
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Figure  4.3:  Average  delay  against  network  size  for  complete  quaternary  tree. 


associated  with  PO,  selfish  optimization  and  the  uniform  strategy  actually  coincide 
for  the  network  scales  used  in  the  simulation.  This  expander  graph  is  an  example  of 
the  efficient  graphs,  where  the  diversity  of  paths  leads  to  optimal  social  costs  without 
incentive  mechanisms. 


4.6  Conclusion  and  Future  Work 

This  chapter  has  shown  that  social  network  topologies  can  be  categorized  into 
three  classes  according  to  efficiency  of  information  spreading:  efficient,  inefficient 
amenable,  and  inefficient  suboptimal.  This  chapter  has  also  proposed  the  Plus- 
One  mechanism,  an  incentive-based  mechanism  that  brings  the  costs  in  inefficient 
amenable  graphs  close  to  optimal.  Inefficient  suboptimal  graphs,  on  the  other  hand, 
are  resilient  to  our  mechanism,  in  that  the  cost  under  distributed  attention  allocation 
is  reduced  close  to  the  optimal  social  cost,  but  the  optimal  social  cost  itself  is  quite 
high.  For  such  graphs,  mechanisms  that  go  beyond  incentives  for  attention  shifting, 
those  that  induce  a  change  in  graph  structure  or  in  the  budgets  of  attention  are 
needed. 
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Figure  4.4:  Average  delay  against  network  size  for  random  3-regular  network. 


We  have  demonstrated  the  effectiveness  of  the  Plus-One  mechanism  by  simula¬ 
tion.  ffowever,  it  will  be  of  interest  to  have  a  formal  investigation  of  its  convergence 
property  and  provable  performance  guarantee. 

We  have  assumed  in  the  present  chapter  that  all  users  are  interested  in  receiving 
information  from  all  sources.  As  an  extension  of  the  present  chapter,  we  may  con¬ 
sider  a  more  interesting  and  realistic  case  where  users  have  differing  sets  of  interests. 
Mechanism  design  for  such  scenarios  is  decidedly  more  complex,  with  a  more  intricate 
contact  structure. 
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CHAPTER  5 


CONCLUSIONS  AND  FUTURE  WORK 


This  thesis  studied  various  aspects  of  networks  characteristics  and  dynamics,  with 
focus  on  reciprocity,  competition  and  information  dissemination. 

Chapter  2  investigated  the  maximum  reciprocity  problem  and  its  use  with  re¬ 
gard  to  the  interpretation  of  empirical  reciprocity  in  real  networks.  We  proposed  to 
interpret  empirical  reciprocity  based  on  its  comparison  with  the  maximum  possible 
reciprocity.  We  proved  that  the  maximum  reciprocity  problem  is  NP-hard,  so  we  did 
the  comparison  with  an  upper  bound  instead.  We  found  that  this  bound  is  surpris¬ 
ingly  close  to  the  empirical  reciprocity  in  a  wide  range  of  real  networks,  and  that 
there  is  a  surprisingly  strong  linear  relationship  between  the  two.  We  demonstrated 
that  a  particular  type  of  small  suboptimal  motifs  called  3-paths  are  the  major  cause 
for  suboptimality  in  real  networks. 

There  are  several  future  directions  related  to  Chapter  2.  Given  the  usefulness 
and  NP-hardness  of  the  maximum  reciprocity  problem,  it  is  of  interest  to  design  ap¬ 
proximation  algorithms  with  performance  guarantee.  The  linear  relationship  between 
empirical  reciprocity  and  the  upper  bound  is  intriguing  and  invites  a  careful  study 
for  its  explanation.  In  Chapter  2,  we  fixed  the  degree  sequence.  It  will  be  useful  to 
consider  small  perturbation  to  the  degree  sequence  and  study  the  robustness  of  the 
results.  We  focused  on  maximum  reciprocity  with  given  degree  constraints.  To  get 
a  complete  picture,  it  is  necessary  to  study  the  full  spectrum  of  reciprocity  and  in 
particular  its  minimum.  More  broadly,  there  are  many  other  network  characteristics 
of  interest,  whose  interdependence  is  worthy  of  investigation. 
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Chapter  3  analyzes  competition  dynamics  under  cumulative  advantage.  We  char¬ 
acterize  the  tail  distributions  of  duration  and  intensity  for  pairwise  competition  under 
cumulative  advantage.  We  demonstrate  that  duration  always  has  a  power-law  tail  ir¬ 
respective  of  competitors’  fitness,  while  intensity  has  either  a  power-law  tail  or  an 
exponential  tail  depending  on  whether  the  competitors  are  equally  fit.  We  observe 
the  struggle-of-the-fitness  phenomenon,  where  a  slight  different  in  fitness  results  in 
an  extremely  heavy  tail  of  duration  distribution. 

For  future  work,  it  is  of  interest  to  close  the  gap  between  the  asymptotic  upper 
and  lower  bounds  for  the  duration  distribution  when  the  competitors  are  not  equally 
fit.  Another  direction  is  to  extend  the  results  to  more  than  two  competitors  and 
eventually  to  the  full  network  setting. 

Chapter  4  studied  the  efficiency  of  information  dissemination  in  social  networks 
with  limited  budget  of  attention.  We  quantified  the  efficiency  of  information  dissem¬ 
ination  for  both  cooperative  and  selfish  user  behaviors  in  various  network  topologies. 
We  identified  topologies  where  cooperation  plays  a  critical  role  in  efficient  information 
propagation.  We  proposed  an  incentive  mechanism  called  “plus-one”  to  coax  users 
into  cooperation  in  such  cases,  and  demonstrated  its  effectiveness  through  simulation. 

Future  investigation  is  needed  for  the  convergence  property  and  provable  perfor¬ 
mance  guarantee  of  the  “plus-one”  mechanism.  An  analysis  of  power-law  networks 
will  be  be  valuable,  since  many  social  networks  exhibit  power-law  degree  distributions. 
Another  natural  extension  is  to  allow  users  to  have  differing  sets  of  interests. 


83 


APPENDIX  A 


ADDITIONAL  PROOFS  FOR  CHAPTER  2 

A.l  Proof  of  Theorem  2.3.7 

We  adapt  the  proof  for  Theorem  2.2  of  [16]  that  deals  with  packing  two  graphic 
sequences  for  undirected  graphs.  Without  loss  of  generality,  we  can  assume  that 
Vo  =  V,  since  removing  isolated  vertices  does  not  change  the  conclusion.  Assume 
that  conditions  (l)-(3)  hold  and  consider  the  set  Q  of  all  pairs  of  digraphs  (Gi,G2) 
such  that 

(i) .  G i  is  symmetric  with  degree  bi-sequence  (d°,d°), 

(ii) .  G2  has  degree  bi-sequence  (d+  —  d°,  d_  —  d_), 

(iii) .  the  union  G  =  G\  +  G2 ,  as  a  multi-digraph,  has  degree  bi-squence  (d+,d_). 

Note  that  G±  can  be  identified  with  an  undirected  graph  with  degree  sequence  d°. 
Conditions  (1)  and  (2)  guarantee  that  Q  ^  0.  Among  all  pairs  in  Q,  choose  a  pair 
(G\,G2)  such  that  the  number  of  shared  edges  \G\C\G2\  is  minimized.  We  will  show 
that  Gi  fl  G2  =  0,  so  their  union  G  =  G\  +  G2  is  a  realization  of  (d+,  d~)  and  hence 
P(d+,d")  >  p(G)  >  |  Ci  |  =  2  —  2m.  To  this  end,  we  will  show  that  condition 

(iii)  would  be  violated  if  G\  fl  G2  ^  0. 

Assume  there  exists  an  edge  (x,  y )  G  GinG2.  Since  G\  is  required  to  be  symmetric, 
for  the  sake  of  notational  simplicity,  we  will  use  the  same  notation  (a,  b )  for  a  single 
edge  to  refer  to  the  pair  of  edges  (a,  b )  and  (6,  a)  in  G i,  which  is  represented  pictorially 
by  an  undirected  edge. 
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x  -x -  y 


x  -  y 


x  -  y 


\  / 


v  X  w 

(a) 


v  -> <- w 
(b) 


♦X-  w 

(c) 


Figure  A.l:  Proof  of  Claims  1-3.  The  number  of  shared  edges  is  reduced  by  rewiring 
the  edges  marked  by  red  crosses  into  the  dashed  green  edges.  Each  undirected  edge 
represents  a  pair  of  reciprocated  edges. 


For  v  G  V,  let  Ng(v )  =  {u  :  (v,u)  G  G}  be  the  out-neighbors  of  v  in  G  and 
Nq(d)  =  { u  :  (u,v)  G  G}  the  in-neighbors  of  v  in  G.  Let  NG(v)  =  NG(v)  U  Ng(v) 
be  the  neighbors  of  v  in  G.  For  W  C  V,  let  NG(W)  =  [jweW  N£ (w),  Ng(W)  = 
Uwgw  Ng(w)  and  NG(W)  =  Nq(W )  U  iVG(W).  We  use  the  convention  jVG(0)  = 
iV-(0)  =  iVG(0)  =  0.  For  Vu  V2  C  V',  let  V1®V2  =  {(uj,  v2)  G  F,  x  P2  :  u,  ^  u2}. 

Now  consider  I  =  V  -  [NG(x)  U  7Vg(j/)].  Let  W1  =  NGl(I),  W2{I )  =  iV+2(J)  n 
iVG2(J),  W+  =  Nq2(I)  -  Nq2  (/)  and  W"  =  iVG2(/)  -  7V+(/).  Note  that  iVG(J)  = 
IF1  +  W2  +  1F+  +  W~. 

We  break  the  proof  into  several  claims. 

Claim  1.  W1  C  Ng(x)  D  NG(y). 

Proof.  Suppose  W1  ^  0.  Let  w  G  W1  and  v  G  I  such  that  (v,  w)  G  G±.  If  w  £  NG(x), 
then  G\  —  Gy  —  {(x,y),  (v,w)}  +  {(x,w),  {y,v)}  would  reduce  the  number  of  shared 
edges;  see  Figure  A. 1(a).  Therefore,  w  G  Ng(x).  Similarly,  w  G  Nc(y).  □ 

Claim  2.  N£2(I)  C  N+(x). 

Proof.  Suppose  Nq2(I)  ^  0.  Let  w  G  Nq2(I)  and  v  G  I  such  that  (v,w)  G  G2.  If 
w  ^  Nq(x),  then  G'2  =  G2  —  {(x,y),  (v,  ru)}  +  {(a;,  w),  ( v,y )}  would  reduce  the  number 
of  shared  edges;  see  Figure  A. 1(b).  Therefore,  w  G  NG(x).  □ 

Claim  3.  NgJI)  c  Nc(y). 
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x  -x -  y 


/  Wl - W2 

Zi  Z2 


(a)  wi,  w2  €  W1. 


x  -  y 


/  Wi---*W2 

L?K  SCi 

-1  ^2 


(b)  Ti,i€l\£a(J),u,2GJV+(I) 


G2' 


Figure  A. 2:  Proof  of  Claims  4-5.  The  number  of  shared  edges  is  reduced  by  rewiring 
the  edges  marked  by  red  crosses  into  the  dashed  green  edges.  Each  undirected  edge 
represents  a  pair  of  reciprocal  edges. 


Proof.  Suppose  Nq2(I )  0.  Let  w  £  Nq2(I )  and  v  £  I  such  that  (w,v)  £  G2.  If 
w  f:  Nf,(y),  then  G'2  =  G2  —  {(x,y),  (v,  w)}  +  {(x,  v),  ( w,y )}  would  reduce  the  number 
of  shared  edges;  see  Figure  A. 1(c).  Therefore,  w  £  NG(y).  □ 

ft  follows  from  Claims  2  and  3  that  W2  C  Nq  (x)  D  NG(y),  W+  C  NG{x)  and 
W~  C  Nf,(y).  Note  that  NG(I )  C  NG(x )  U  NG(y)  =  Ic.  As  a  result,  there  is  no  edge 
with  both  ends  in  I. 

Claim  4.  There  exists  an  edge  in  G  between  every  pair  of  distinct  vertices  in  W 1 . 

Proof.  Let  wi,w2  £  W1  and  zi,z2  £  /  such  that  ( Zi,Wi )  £  Gi  for  i  =  1,2,  where 
are  02  are  not  necessarily  distinct.  If  (wi,w2)  ^  G  and  (w2,w  1)  f  G,  then  G\  = 
Gi  —  {(x,  y),  (wi,  01),  (w2,  02)}  +  {(x,  01),  (y,  z2),  (w±,  w2)}  would  reduce  the  number  of 
shared  edges;  see  Figure  A. 2(a).  Therefore,  either  (wi,w2)  £  G  or  (w2,w  1)  £  G.  □ 

Claim  5.  NG2(I)  ®  iV+2(/)  C  G. 

Proof.  Let  (wi,w2)  £  Nq2(I )  <8>  Nq2(I).  Let  01,02  £  /  such  that  (iui,0i)  £  G2 
and  (z2,w2)  £  G2,  where  01  are  02  are  not  necessarily  distinct.  If  (wi,w2)  f  G, 
then  G'2  =  G2  -  {(x,y),  (wi,0i),(z2,w2)}  +  {(x,  01),  (02,  r/),  (wi,  w2)}  would  reduce 
the  number  of  shared  edges;  see  Figure  A. 2(b).  Therefore,  (wi,w2)  £  G.  □ 


As  a  result  of  Claim  5,  W2  <g)  W2  C  G  and  W  ®  W+  C  G. 


x  -x-  y 


r 

x  -  2/ 


y  - x 


W l  w2 


Wl  ^2 


w  1  -X-K  W2 


if  ifti  U!  Aj  u?  AJ 

*2  ^  A  '  Z2  Z3  Z!  Z3  Z2 

(a)  (wl,w2),(w2,w1)  $  G.  (b)  (wuw2)  £  G,(w2,wi)  G  G.  (c)  {wuw2)  G  G,(u72,u;i)  $.  G. 


Figure  A. 3:  Proof  of  Claim  6.  The  number  of  shared  edges  is  reduced  by  rewiring 
the  edges  marked  by  red  crosses  into  the  dashed  green  edges,  where  W\  G  W1  and 
w-2  G  W2.  Each  undirected  edge  represents  a  pair  of  reciprocal  edges. 


Claim  6.  W1  <g>  W2  C  G  and  W2  <g)  W1  C  G. 

Proof.  Let  W\  G  VF1,  w2  G  W 2  be  such  that  W\  ^  w2.  Let  z i,  z2,  z3  G  /  be  such  that 
(wi,Zi)  G  G i,  (z3,w2)  G  G-2  and  ( w2,z2 )  G  G2,  where  z3,z2,z3  are  not  necessarily 
distinct.  We  will  show  that  if  (wi,w2)  ^  G  or  (w2,wi)  ^  G,  we  would  be  able  to 
find  a  new  pair  of  graphs  (G[,  G'2)  G  Q  such  that  \G\  C\G'2\  <  |Gi  D  G2|,  which  would 
contradict  the  choice  of  (Gi,G2).  Consider  three  cases. 

(i) .  If  (wi,w2)  ^  G,  (w2,wi)  ^  G,  then  let 

Gi  =  Gi  -  {(a  i/),  (wi,  a)}  +  {(a?, *0,  (wi,  w2)}, 

G'2  =  G2  —  {(^3,U72),  (w2,£2)}  +  {(^3,2/),  (y,Z 2)}; 
see  Figure  A. 3 (a). 

(ii) .  If  (wi,w2)  f  G,  (w2,wi)  G  G,  then  let 

G\  =  Gi  -  {(tui,zi)}  +  {(wi,u;2)}, 

G2  =  G2  -  {(x,y),  (*3,w2),  (w2,u;i)}  +  {(z3,2/),  (m,«h)}; 
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see  Figure  A. 3(b). 


(iii).  If  (wi,w2)  G  G,  (w2,w1)  ^  G,  then  let 

G\  =  Gi  -  {(wi,  zi)}  +  {(wi,  w2)}, 

G'2  =  G2-  {(x,  y),  (wi,  w2),  [w2,  z2 )}  +  {(x,  z2),  (wlt  zi),  (zi,  x)}; 
see  Figure  A. 3(c). 

Therefore,  (wi,w2)  G  G,  (w2,w1)  G  G.  □ 

Claim  7.  A  >  3. 

Proof.  Note  that  A  >  2  since  df  +  d~  —  d°x  >  df  >  2.  If  A  =  2,  then  NG(x)  =  {y} 
and  NG(y )  =  {x}.  Thus  NG(I )  C  {x,y}.  But  x,y  ^  NG(I)  by  the  definition  of  /. 
Therefore,  NG(I)  =  0,  and  hence  =  df  —  0  for  every  v  G  /  =  V  —  {x,?/}.  It  then 
follows  that  5  =  A  =  2  and  n  =  2.  A  direct  calculation  shows  that  condition  (3)  is 
violated.  Therefore,  A  >  3.  □ 

Now  consider  the  cut  (/,  Jc).  Let  hF°  =  bF1  U  bF2.  Let  w°  =  |bF°|,  u>+  =  |bF+|, 
w~  =  |bF_|.  We  will  count  the  number  of  edges  across  the  cut  in  a  special  way.  An 
edge  from  G2  in  either  direction  is  counted  as  one  edge,  while  a  pair  of  reciprocal 
edges  from  G\  is  counted  as  one  edge.  Note  that  the  number  of  edges  across  the  cut 
is  bounded  by 

E(I,Ic)>8\I\=5(n-\P\), 

since  each  vertex  in  /  contributes  at  least  5  edges.  Note  that  |Ag(x)|  <  A  —  1  and 
|IVg(?/)|  <  A  — 1,  since  the  edge  (x,y)  has  multiplicity  2.  Since  bF°  C  NG(x)C\NG(y), 

|/c|  =  | Ng(x)  U  NG{y) |  =  |Ag(x)|  +  \NG{y)\  -  \ NG(x)  n  NG(y) \  <  2A  -  2  -w°, 


and  hence 


E(I,IC )  >  8{n  —  2A  +  2  +  w°). 

On  the  other  hand,  each  vertex  in  W°  must  connect  to  x,  y  and  every  other  vertex 
in  bh°,  and  hence  contributes  at  most  A  —  1  —  w°  edges  to  E(I,IC).  Each  vertex 
in  W+  must  connect  to  x  and  every  vertex  in  W~,  and  hence  contributes  at  most 
A  —  1  —  w~  edges  to  E(I,IC).  Similarly,  every  vertex  in  W~  contributes  at  most 
A  —  1  —  w+  edges  to  E(I,  Ic ).  Therefore, 

E(I,  Ic )  <  w°( A  —  1  —  w0)  +  w+( A  —  1  —  w~)  +w~( A  —  1  —  w+). 

Combining  the  two  inequalities  for  E(/,/c),  we  obtain 

f(w°,  w+,  w~ )  >  8(n  —  2A  +  2), 


where 

f(w°,  w+,  w~ )  =  w°( A  —  8  —  1  —  w° )  +  w+(A  —  1  —  w~ )  +  w~(A  —  1  —  w+ ). 

Note  that  w0  +  w+  <  |AG(a;)  —  {t/}|  <  A  —  2.  Similarly  w0  +  w~  <  A  —  2.  If  we 
maximize  /  subject  to  these  feasibility  constraints,  the  inequality  should  still  hold. 

Claim  8.  The  maximum  value  of  f(w°,w+ ,w~)  subject  to  the  following  constraints 

w°  +  w+  <  A  —  2, 
w°  +  w~  <  A  —  2, 
w°,  w+,w~  >  0, 


zs  f*  —  (A  —  2)  (A  —  1). 


Proof.  Note  that  f*  is  achieved  by  w°  =  w~  =  0  and  w+  =  A  —  2.  Thus  it  remains 
to  show  that  /  <  /*  for  all  feasible  (w° ,w+ ,w~).  For  fixed  w°  and  w+,  f  is  linear 
in  w~,  where  w~  G  [0,  A  —  2  —  ta0].  Thus  in  order  to  maximize  /,  we  only  need  to 
consider  w~  G  {0,  A  —  2  —  ta0}.  By  the  same  argument,  we  only  need  to  consider 
w+  G  {0,  A  —  2  —  ta0}.  Since  /  is  symmetric  in  w+  and  w~,  we  only  need  to  consider 
three  cases. 

(i) .  w+  =  w~  =  0.  In  this  case,  0  <  w°  <  A  —  2,  and 

f(w°,  0, 0)  =  ra°(A  -  <5  -  1  -  w°)  <  (A  -  2) (A  -  5  -1)  <  f*. 

(ii) .  w~  =  0  and  w+  =  A  —  2  —  w°.  In  this  case, 

f(w°,w+,  0)  =  w°(A  —  5—  1  —  ta°)  +  (A  —  2  —  ta°)(A  —  1)  =  fk  —  w0(w°  +  5)  <  /*. 

(iii) .  w+  =  w~  =  A  —  2  —  w°.  In  this  case, 

f(w°,  w+,w~)  =  w°{ A  -  5  -  1  -  W°)  +  2(A  -  2  -  w°)(w0  +  1) 

=  w°(3A  -5-7  -  3  w°)  +  2(A  -  2) 

<  w°(3A  -  8  -  3w° )  +  2(A  -  2). 

If  A  =  3,  then  w°  G  {0, 1}  and 

/O0,  w+,  w~)  <  w°(l  -  3 w°)  +  2  <  2  =  /*. 

If  A  >  4,  set  ta°  =  (3 A  —  8)/6  and 

f(w°,  w+,  w~)  <  ^(3A  -  8)2  +  2(A  -  2). 
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Thus 


r  -  /  >  (A  -  1)(A  -  2)  -  ^(3A  -  8)2  -  2(A  -  2)  =  ^A(A  -  4)  +  >  0. 

Therefore,  /  <  /*  for  all  feasible  (w°,w+,w~),  which  completes  the  proof.  □ 

Now  we  have 

(A  -  1)(A  -  2)  =  /*  >  S(n  -  2A  +  2), 

and  hence 

A 

which  violates  condition  (3).  Therefore,  G\  D  G2  =  0  as  desired. 

A. 2  Proof  of  Proposition  2.4.2 

Let  and  be  the  digraph  G  and  the  set  S  before  the  i-th  iteration  of  the 

while  loop  of  lines  2-10.  Given  a  vertex  v,  let  fl^  be  the  set  of  3-paths  in  G^ 
that  starts  at  v,  and  fL^  C  II  the  set  of  non-Type  IV  3-paths.  We  first  prove  the 
following:  If  fl^}  ^  0,  then  v  G  This  trivially  holds  for  i  —  1  since  =  V. 
Assume  it  holds  for  the  i-th  iteration.  Consider  the  (i  +  l)-st  iteration.  Let  wq  be 
such  that  n^1}  0.  Let  vq  the  node  picked  on  line  3  of  the  i-th  iteration.  Consider 

two  cases. 

(1) .  Suppose  the  condition  on  line  4  is  false.  In  this  case,  G(t+1>  =  G^  and  hence 

fiS  =  n^1}  7^  0.  Thus  Wo  7^  Vo,  and,  by  the  induction  hypothesis,  Wo  G  A-V 
By  line  8,  S^+1^  =  —  {u0},  so  w0  G  S^+1\ 

(2) .  Suppose  the  condition  on  line  4  is  true.  Let  n  =  (no,  V\,  V2,  V3)  be  the  3-path 

rewired  in  the  i-th  iteration.  Since  SA+b  —  gM  ij  ,  v2}  by  line  6,  by  the 
induction  hypothesis,  it  suffices  to  show  that  11^  7^  0  for  wq  ^  {vo,  Vi,  V2,  U3}. 
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Assume  wo  ^  {uo,  Vi,  v2,  u3}.  If  7 r  is  of  Type  III,  then  Ga C  Ga'1  and  hence 
0  7^  n^0+1)  C  n[,;o .  Now  suppose  n  is  of  Type  I  or  II.  Pick  a  3-path  7Ti  = 
(w0,  wi,  w2,  w3)  G  flwfj"1^-  Note  that  the  only  edge  in  Ga+1^  \  G$  is  (v0,v3). 
Since  (wq,w3)  ^  Ga+1^  and  wq  7^  Vo,  we  obtain  (wq,w3)  ^  Ga\  If  7Ti  G  Il^o, 
then  7Ti  G  fL^.  If  ^  ^  n^,  then  either  (■ Wi,w2 )  or  ( w2,w3 )  must  be  the 
newly  added  edge  (fo,u3).  Suppose  (wi,w2)  =  (ty, u3).  Then  7t2  =  (iuo,iui  = 
WW1W2)  G  n u)0.  If  (w0,v2)  £  Ga\  then  7t2  G  ni?,o.  If  (w0,v2)  G  Ga\  then 
7t3  =  (w0,u2,u3  =  w2,w3)  G  11$.  Since  (w0,ty3)  ^  G%\  vr3  G  fl$.  Thus 
flS  7^  0  if  (wi,w2)  =  (i’o,v3).  The  same  argument  shows  that  f[$  7^  0  if 
(w2,w3)  =  (vq,v3).  Therefore,  n$  ¥"  0  for  ah  cases. 

Therefore,  nj,?'1  7^  0,  then  v  G  S^f  When  Algorithm  1  terminates,  S  =  0,  so 
there  is  no  non-Type  IV  3-paths.  Now  it  remains  to  show  that  Algorithm  1  indeed 
terminates.  For  this  purpose,  let  X*  =  |S'^+1')|  —  Let  V:  =  1  if  if  the  i-th 

iteration  rewires  some  3-path  and  Yt  =  0  otherwise.  Note  that  if  V  =  1,  0  <  X \  <  2 
and  |Ga+1^|  <  |Cr$|  —  2;  otherwise,  X*  =  —1  and  |Ga+1^|  =  |Cr$|.  After  the  i-th 
iteration, 

i 

0<|G<i+1>|<|G«|-2j>j, 

3= 1 

and  hence  2  Yffj=\Yj  <  |  Ga  ^  |  <  \E\.  Thus 

i 

|£b+i)|  =  1^(1)  |  +  J^Xj 
3= 1 

i  i 

<  M  +  2  -£(i-*> 

i=i  i=i 

i 

=  |r|-i  +  3^y, 

i=i 

<|C|  +  5|£)|-i. 

It  follows  that  Algorithm  1  terminates  in  at  most  \V\  +  \\E\  iterations. 
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A. 3  Proof  of  Lemma  2.4.8 

We  break  the  proof  into  several  claims. 

Claim  1.  (E0  U  Ei)  D  G*  =  0. 

Proof.  For  k  <  £,  let  n[vk,  vf\  be  the  sub-path  of  n  from  to  V£.  Suppose  there  exists 
(■ Vi,Vj )  G  (E0  U  Ei)  D  G*.  Note  that  i  =  j  mod  2.  If  i  <  j,  then  tt[vq ,vf\  +  ( Vi,Vj )  + 
7 r[vj,  V‘2P]  is  a  path  of  odd  length  2p+l  +  i  —  j ,  which  requires  (vo,  v2p)  G  G*  by  Lemma 
2.4.3,  a  contradiction.  If  z  >  j,  then  tt [v3 ,  iy]  +  (iy,  vf)  is  a  cycle  in  G*.  By  Lemma  2.4.4, 
this  must  be  a  3-cycle  and  i  =  j- 1-2.  By  symmetry,  we  can  assume  j  >  1.  Lemma  2.4.5 
applied  to  Vj_ i  and  the  3-cycle  (vj,Vj+i,Vi,Vj)  then  requires  (i^_ i,Vj+i)  G  G*,  which 
we  have  just  shown  is  impossible.  Therefore,  (E0  U  Ef)  D  G*  =  0.  □ 

By  virtue  of  Claim  1,  a  pair  of  edges  ( Vi,Vj )  and  ( Vj,Vi )  of  E0  U  E\  are  either  both 
in  G *  or  both  outside  G*.  Thus  we  only  need  to  consider  (vt,  Vj)  G  E0  U  E\  for  i  <  j. 

Claim  2.  Either  (r’o,  V2P)  G  G *  or  (■ vi,V2P~i )  G  G*. 

Proof.  Suppose  the  contrary.  By  Claim  1,  (vq,  V2P)  ^  G*  and  (vi,V2P-i)  ^  G*.  Let 
H  =  G*  -  {(v0,vi),(v2p-i,v2p)}  +  {(vo,v2p),(v2p-i,vi)}.  Then  p(H )  =  p(G*)  and 
hence  H  is  also  a  maximum  digraph.  Now  7r[ui, u2p-i]  +  (v2p-i,Vi)  is  a  (2 p  —  l)-cycle 
in  Ha.  If  p  >  2,  this  contradicts  Lemma  2.4.4.  If  p  =  2,  this  contradicts  Lemma  2.4.5 
since  (u0,  i>i)  ^  but  (uoCbp-i)  G  Ha  by  applying  Lemma  2.4.3  to  7r[u0,  u2p-i]-  □ 

Claim  3.  If  (vo,v2p)  G  G*,  then  G*  D  E\  =  0  and  E0  C  G*. 

Proof.  Suppose  (u2*-i,  ^j-i)  G  G*,  where  j  >  i  >  1.  Note  that  (v2i-i,v2p)  G  G*  by 
Lemma  2.4.3.  Then  G  =  7 r[u0,  u2j-i]  +  (u2j-i  W2i-i ,v2p,  v0 )  satisfies  the  assumption  of 
Lemma  2.4.6.  Thus  there  exists  an  H  G  £/(d+,d-)  with  p(H)  =  p(G*)  +  2 (j  —  1)  > 
p(G),  a  contradiction.  Therefore,  (u2j_i,  n2j-i)  ^  G*  and  hence  fl  G*  =  0. 
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Vi  ->£>-  v2 


V5  -*>£-  Vi 


Vi  M -  v3 


(a)  p  =  3,i  =  1  ,j  =  2. 


(b)  p  =  2,i  =  0 ,j  =  1. 


Figure  A. 4:  Proof  of  Claim  3.  Reciprocity  can  be  increased  by  rewiring  the  edges 
marked  by  red  crosses  into  the  dashed  green  edges. 

Suppose  (v2 i,  v2j )  ^  G*,  where  i  <  j.  Since  ( Vo ,  v2p )  £  G*,  either  i  >  1  or  j  <  p—  1. 
By  symmetry,  we  may  assume  that  j  <  p  —  1.  Let 


H  =  G*  -  -  {{v2k,v2k+1)}pkJj/2  -  {(v2p,u0)} 

+  {{V2k-l,v2k-2)}k=i  +  {(v2k,V2k-l)}l=j/2+i  +  {(V2j,v2i)}', 


see  Figures  A. 4.  Then  H  £  Q(d+ ,  d~)  and  p(H )  =  p(G *)  +  2 i  +  2 (p  —  1  —  j)  >  p(G*). 
Thus  H  is  also  a  maximum  digraph,  i  =  0  and  j  =  p  —  1.  Now  (T0,Ui, . . .  ,v2j,v 0) 
is  a  cycle  of  length  2j  +  1  =  2p  —  1  in  Ha.  This  contradicts  Lemma  2.4.4  if  p  >  2. 
For  p  =  2,  by  applying  Lemma  2.4.5  to  v3  and  the  3-cycle  (vo,  vi,v2,  Vo),  we  obtain 
(i’i,v3)  £  Ha,  contradicting  EiCiG*  =  0;  see  Figure  A. 4(b).  Therefore,  (v2il  v2j)  £  G * 
and  hence  Eq  C  G*.  □ 

Claim  4.  If  (wi,  v2p-i)  £  G* ,  then  G*  D  E0  =  0  and  E\  C  G*. 

Proof.  First  consider  the  case  p  >  3.  Claim  3  applied  to  7r[t’i ,  u2p_i]  yields  E3  C  G* 
and  (v2i,v2j)  ^  G*  for  i  >  1  and  j  <  p  —  1.  It  remains  to  show  (v2i,v2j)  ^  G* 
for  i  =  0  or  j  =  p.  By  symmetry,  we  only  need  to  show  (vo,v2j)  ^  G*.  Suppose 
( v0 ,  v2j)  £  G*.  Consider  the  cycle  G  =  (n0,  v3)  +  7t[u3,  u2p-i]  +  (v2p_i,  ui,  n2j-,  v0 ),  which 
has  length  2p  and  satisfies  the  assumption  of  Lemma  2.4.6.  Note  that  G  fl  G*  = 
{(^2P-i,  Vi),  {v2j,  no)}.  Lemma  2.4.6  then  yields  an  H  £  C/(d+,d”)  with  p(H )  = 
p(G*)  +  2p  —  4  >  p(G*),  a  contradiction.  Thus  (n0,  v2j)  f  G*  and  L0  6  G*  =  0. 
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Figure  A. 5:  Proof  of  Claim  4.  Reciprocity  can  be  increased  by  rewiring  the  edges 
marked  by  red  crosses  into  the  dashed  green  edges. 


For  p  =  2,  Ei  C  G*  trivially.  By  Claim  3,  (n0,n4)  £  G*.  To  show  E0  n  G*  =  0,  by 
symmetry,  we  only  need  to  show  (v0,v2)  ^  G*.  Suppose  (v0,v2)  £  G*.  Let 

H  =  G*  -  {(u0,vi),  (v2,v0),  (v3,v4)}  +  {(v3,v0),  (v0,v4),  (u2,vi)}; 

see  Figure  A. 5.  Then  H  €  C?(d+,d_)  and  p(H )  =  p{G )  +  1,  a  contradiction.  Thus 
(i>o,  v2)  ^  G *  and  E0  D  G*  =  0.  □ 
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APPENDIX  B 


DATASETS  IN  SECTION  2.5 


Table  B.l:  Statistics  of  some  real  networks.  The  datasets  without  explicit  citations 
are  from  the  SNAP  repository  [52],  This  table  shows  for  each  network  the  number  of 
nodes  (column  2),  the  number  of  edges  (column  3),  the  number  of  reciprocal  edges 
(column  4),  the  number  of  reciprocal  edges  in  a  3-path  optimal  digraph  returned 


by  the  GreedyRewire  algorithm  on  page  20  (column  5),  and  the  upper 
Proposition  2.3.1  (column  6). 

bound  in 

Network 

Nodes 

Edges 

Reciprocal  Edges 

Observed  Algo.  1  Bound 

Biological  networks 

C.  Elegan  [73,  74] 

297 

2345 

394 

1364 

1467 

Mouse-Cortex  [80] 

49 

964 

656 

804 

825 

Protein  [70] 

6339 

34814 

4216 

22066 

23630 

Yeast  [68] 

6725 

201775 

1090 

6446 

8835 

A.  Thaliana  [76] 

10134 

15580 

12 

40 

77 

Communication  networks 

email-EuAll 

265214 

418956 

108950 

128192 

143287 

wiki- Talk 

2394385 

5021410 

723690 

1219196 

1285201 

Product  co-purchasing  networks 

amazon0302 

262111 

1234877 

670170 

801530 

858907 

amazon0312 

400727 

3200440 

1701142 

1945350 

2079813 

amazon0505 

410236 

3356824 

1834774 

2092700 

2227333 
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Table  B.l:  (continued) 


Network 

Nodes 

Edges 

Reciprocal  Edges 

Observed 

Algo.  1 

Bound 

amazon0601 

403394 

3387388 

1887960 

2130012 

2266214 

Social  networks 

Epinionsl 

75879 

508837 

206194 

299778 

317821 

Slashdot0811 

77360 

828161 

717962 

731044 

737201 

Slashdot0902 

82168 

870161 

731862 

749436 

758751 

Pokec 

1632803 

30622564 

16641200 

21997368 

22813049 

wiki- Vote 

7115 

103689 

5854 

31126 

35989 

LiveJournall 

4847571 

68475391 

51248308 

55619590 

56984610 

LiveJournal  [56] 

5204176 

76937805 

56456064 

61806458 

63451685 

Flickr  [56] 

1715255 

22613980 

14117878 

16401174 

16998181 

YouTube  [56] 

1138499 

4945382 

3909878 

3996410 

4086949 

Twitter  [51] 

41652230 

1468364884 

531703676 

690897836 

875520298 

ego-Twitter 

81306 

1768135 

851678 

1112236 

1179627 

Google+  [35] 

61858438 

948605109 

321728626 

414578876 

443168800 

ego-Google+ 

107614 

13673453 

2870336 

4954418 

5481158 

Stackoverflow  [77] 

1749197 

11894846 

26558 

2445802 

2965936 

Web  graphs 

BerkStan 

685230 

7600595 

1902250 

2257148 

2913141 

Google 

875713 

5105039 

1565976 

2106234 

2460500 

NotreDame 

325729 

1469679 

759142 

821340 

907239 

Stanford 

281903 

2312497 

639722 

770266 

983414 
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Table  B.l:  (continued) 


Network 

Nodes 

Edges 

Reciprocal  Edges 

Observed 

Algo.  1 

Bound 

Wikipedia  [1] 

English 

4709883 

328267748 

176523698 

215049808 

227103696 

Swedish 

1946669 

49061638 

10296750 

12792974 

13689733 

Dutch 

1794354 

50061183 

19993040 

23471168 

25078755 

German 

1738087 

69385800 

28079234 

38594032 

41799602 

French 

1555872 

87231786 

38347858 

49859546 

53102549 

Russian 

1163335 

68613850 

35807558 

42472180 

44437671 

Italian 

1160082 

85261756 

48584200 

55921822 

58593672 

Spanish 

1109589 

32489175 

4927794 

10429430 

11654906 

Polish 

1072883 

51993365 

28351902 

32917546 

34433059 

Japanese 

936882 

61591797 

26512542 

36239836 

38326442 

Portuguese 

841064 

39840808 

19062374 

23016802 

24224634 

Chinese 

781344 

49703600 

31848356 

36082340 

37248389 

Korean 

290291 

15595628 

9318976 

10859386 

11281173 

P2P  networks 

Gnutella04 

10876 

39994 

0 

13878 

16371 

Gnutella05 

8846 

31839 

0 

9584 

11830 

Gnutella06 

8717 

31525 

0 

9606 

11825 

Gnutella08 

6301 

20777 

0 

5604 

6947 

Gnutella09 

8114 

26013 

0 

7064 

8822 

Gnutella24 

26518 

65369 

0 

19142 

23920 

Gnutella25 

22687 

54705 

0 

15292 

19016 

Table  B.l:  (continued) 


Network 

Nodes 

Edges 

Reciprocal  Edges 

Observed 

Algo.  1 

Bound 

Gnutella30 

36682 

88328 

0 

25386 

31236 

Gnutella31 

62586 

147892 

0 

40564 

50227 

Call  Graph  [66] 

Dr  Java 

1702 

2920 

4 

778 

1056 

Endeavour 

724 

2067 

2 

358 

519 

FreeMind 

237 

623 

18 

140 

217 

JabRef 

868 

1532 

2 

340 

523 

j  Edit 

2222 

5172 

10 

1286 

1793 

JForum 

716 

1506 

2 

248 

364 

JPetStore 

222 

328 

0 

30 

42 

Kunagi 

781 

1345 

6 

348 

599 

logicaldoc 

892 

3682 

0 

194 

304 

Makagiga 

1777 

4075 

8 

1106 

1440 

OpenKM 

1390 

2525 

0 

384 

491 

openproj 

2824 

4866 

2 

1428 

1823 

OpenSyncro 

658 

1271 

2 

216 

327 

SweetHome3D 

1118 

2363 

12 

558 

844 

weka 

911 

1737 

2 

392 

581 

Linux  [75] 

12391 

33553 

316 

7982 

10933 
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APPENDIX  C 


ADDITIONAL  PROOFS  FOR  CHAPTER  3 


This  appendix  provides  proofs  for  the  main  results  in  Section  3.4.  We  will  use  the 
following  additional  notations  and  definitions. 

•  Denote  the  transition  probability  from  state  (xo,yo)  to  state  (x,y)  in  t  —  x  + 
y  —  Xq  —  yo  steps  by 

pr(x0,y0-,x,y)  =  P  cAVro)[(Xt,Yt)  =  (x,y)]. 

Note  that  the  transition  probability  is  nonzero  only  for  this  specific  t.  Thus  we 
will  often  omit  to  mention  t  explicitly  hereafter  and  assume  that  the  appropriate 
t  has  been  chosen. 

•  Let  rn  be  the  time  of  the  n-th  tie,  which  can  be  defined  recursively  by  To  =  —  oo 
and 

rn  =  inf >  rn_ i  :  Xt  =  Yt},  n  >  1. 

Note  that  T  =  tn- 

•  Denote  by  qr(x0,y0)  the  probability  of  having  no  tie  after  leaving  state  (x0}y0), 
i.e., 

qr(x0,y0)=F^f[Xt^Yt,t>l\. 

Note  that  qr(x0,x0 )  =  PcA,r0)[r2  =  °°]  and  qr(x0,y0)  =  Pca.^Vi  =  oo]  for 
x0  Vo- 
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•  Denote  by  An  t(x,y)  the  set  of  paths  that  start  from  (x,y)  at  time  0  and  end 
with  the  n-th  tie  at  time  t,  i.e.,  rn  =  t. 


C.l  Proof  of  Theorem  3.4.1 

Note  that  the  CA=  model  is  the  standard  Polya  urn  model.  The  proof  of  The¬ 
orem  3.4.1  combines  known  results  for  this  model.  Starting  from  the  initial  state 
(. xo,yo ),  Xt  has  a  beta-binomial  distribution  with  parameters  xq  and  yo  [45].  Note 
that  the  event  Xt  =  Yt  occurs  only  if  t  —  \x0  —  y0\  +  2k  for  some  integer  k  >  0.  For 
such  t,  Xt  =  Yt  if  and  only  if  Xt  =  Zo  +  k,  where  Zo  =  max{xo,  yo}.  By  Eq.  (6.27)  of 
[«], 


p&pi-v  =  y,]  =  p&rA  =  *, + k\  =  Q .  (c.i) 

Recall  that  qi(x,y)  is  the  probability  of  having  no  tie  after  leaving  state  (x,y).  Thus 

P&;f  \t  =  t}  =  p&;?°: '[xt  =  Yt]  ■  qi(z0  +  k,Zo  +  k ),  (c.2) 

where  the  second  factor  on  the  right-hand  side  is  the  probability  of  having  no  tie  after 
t. 

Recall  that  the  exit  probability  E(x,y)  in  [6]  is  the  probability  of  ever  having  a 
tie  starting  from  (x,y),  including  the  initial  state  (. x,y ).  Thus  for  x  ^  y,  qi(x,y)  is 
related  to  E(x,  y)  by 

qi(x,y)  =  1  -E(x,y). 

Using  Eq.  (22)  of  [6]  for  E(x,y),  we  obtain 

ftOr+M)  =  1  )=r(l  +  1)r(1/2)- 
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However,  qi(x,x)  ^  E(x,x)  =  1.  By  considering  the  one-step  transition  from  (x,x) 
to  (x  +  1,  x)  or  (x,  x  +  1),  we  obtain 


/  \  i  /  N  i  ,  ,  r(x  + 1/2) 

qi(x,x)  =  -qi(x  +  l,x)  +  -<h(x,x  +  1)  =  r(l+1)r(1/2) 


Eliminating  T(x  +  1/2)  by  the  identity 


T(2x)  =  ^~1/222x~1Y{x)Y{x  +  1/2) 


[60,  Eq.  (5.5.5)],  and  using  T(x  +  1)  =  xT(x)  and  T(l/2)  =  y7 tt ,  we  obtain 


Y(2x)  =  1 

^  X,X  x22x~1Y(x)Y(x)  x22x~1B(x,x) 


(C.3) 


Substitution  of  Eqs.  (C.l)  and  (C.3)  into  Eq.  (C.2)  yields 


l(XQ,yo)  \rp  _  n  _  _ 1 


PcAl  [T  =  = 


B(xq,  y0)  (zo  +  k)22k+2z°~1  \k 


For  t  —  |xq  —  l/o  |  +  2/c,  Stirling’s  formula  yields 


(!)  -  fx'x 


and  hence 


p&r)p'=«i  (cu) 

It  is  well-known  that  as  t  — >  oo,  Xt/ (Xt+Yt)  converges  almost  surely  to  a  beta  random 
variable  V.  It  follows  that  \Xt  —  Yt\/(Xt  +  Yt )  — >  \2V  —  1|.  Thus,  for  V  jL  1/2,  which 
holds  almost  surely,  we  have  \Xt  —  Yt\j (Xt  +  Yt)  >0  for  all  large  enough  t.  Therefore, 
Pca * [T  =  oo]  =  0.  Summing  over  t  in  Eq.  (C.4),  we  obtain  as  t  — »  oo, 


Pcau0)[T  >  t]  =  P>CA,l°)  [T  =  t 


{.XQ,yo)\rn  _  ./] 
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I 'V 


OO 


1 


3/2 


2  2-s  2x°+y°-5/2y/TcB(x0ly0)S 

ir _ i _ .-*»* 

2jt  2-  */*y/*B(x0,y0) 

_ 1 _ f — 1/2 

2xo+yo-5/2v/^jg('a;0)  yQ) 


where  we  have  used  the  fact  that  half  of  the  terms  are  zero  in  the  second  step,  and 
s  ads  in  the  third  step.  This  completes  the  proof  of  Theorem  3.4.1. 

C.2  Proof  of  Theorem  3.4.2 

Similar  to  Eq.  (C.2),  we  have 


FCA,r0)[T  =  t]  =  Pr^O,  Vo]  Z0  +  fc,  £0  +  &)  ’  <?r-(z0  +  k,  Z0  +  fc).  (C.5) 


Thus  the  proof  here  amounts  to  Ending  expressions  for  both  pr(xo,yo]  Zo  +  k,  Zo  +  k ) 
and  qr(z0  +  k,z0  +  h)  in  CA^.  We  break  the  proof  into  three  lemmas. 

Lemma  C.2.1. 


/  ,  ,  i  (x0)k(yo)h  fk  +  h\ 

Pr{x0,y0;x0  +  k,y0  +  h)  >  - - - - r -  ,  ,  (C.6) 

{rx0  +  yo)k+h\  k  ) 

for  all  k  >  0,h  >  0. 

Lemma  C.2. 2. 


pr(x0,y0-,x0  +  k,y0  +  h)  < 


(xo)k(yo)h 

(r-^kfrxo  +  y0)h 


(C.7) 


for  all  k  >  0,  h  >  0. 

Lemma  C.2. 3.  For  r  >  1, 

qr(x,  x)  —)■ 

as  x  — y  oo. 


(C.8) 
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Before  proving  these  lemmas,  we  first  use  them  to  prove  Theorem  3.4.2. 


Proof  of  Theorem  3-4.2.  By  Lemma  C.2.1,  we  have 


Pr(x0,y0;  z0  +  k,z0  +  k) 

^  (^0 )k+z0  -x0  ( yo)k+ZQ—yo  f2k  +  2zq  -XQ-  y0\ 

(rX o  +  yo)2k+2z0-x0-y0  \  k  +  Z0  —  X0  ) 

r(rx0  +  yf)  T(k  +  £o)T(&  +  zf)  T(2k  +  2  zq  —  Xq  —  y$  +  1) 

r(xo)r(|/o)  T(k  +  Zq  —  Xq  +  l)V(k  +  £o  —  Vo  +  1)  T(2k  +  2zq  +  (r  —  l)xo) 

Using  the  relation  F(k  +  a)/V{k  +  b)  ~  ka~b  as  k  — »  oo,  we  obtain 


Pr(a0,2/o;2o  +  k,  z0  +  k)  > 


T(ra:o  +  y0)  (r-pao-i 

2x°+y°-2r(x0)r(y0)  J 


=  2(r  +  l)a;o^i(2A;)-(r-1^-1, 


where  tpi  is  given  by  Eq.  (3.4).  Application  of  this  asymptotic  bound  and  Lemma 
C.2.3  to  Eq.  (C.5)  yields 


p {cAyr0)[T  =  t]>  2 (r  -  ljxoVit-^-V*0-1.  (C.9) 


Note  that  Pqa ’^°\t  =  +oo]  =  Pca’^-ZV  =  +°°]  =  0’  where  the  second  equality  will 
follow  from  Theorem  3.4.4,  so  we  will  not  provide  a  separate  proof  here.  Summing 
over  t  in  Eq.  (C.9)  and  noting  that  half  of  the  terms  are  zero,  we  obtain  as  t  — >  oo, 


P&fU  >  t]  =  = A  > 

t'=t 


1  )x0(f1s-^x°-1ds 


establishing  the  lower  bound. 
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In  a  similar  way,  Lemma  C.2.2  yields 


Pr(x0,y0-,z0  +  k,z0  +  k)  <2(r  +  l)(x0-r  1)<p2(2k)  (r  1)(a;o  r  1}  \ 


where  ip2  is  given  by  Eq.  (3.5).  Application  of  this  asymptotic  bound  and  Lemma 
C.2.3  to  Eq.  (C.5)  yields 

=  t]<  2(r  -  l)(i„  - 


and 

OO 

pgo,J/°)[T  >  =  J2^CAVr0)iT  =  *']  £  ^t~' (r-Vto-r-1) 

t'=t 


establishing  the  upper  bound. 


□ 


Now  we  prove  the  lemmas.  Recall  that  the  transition  probability  p(x0,y0]  x,y)  of 
going  from  (xo,yo)  to  (x,y)  satisfies  the  following  recursion  (Chapman-Kolmogorov 
equation), 


pr(x0,y0;x,y )  = 


r(x  —  1) 
r(x  —  1)  +  y 


pr(x0,y0-,x-  1  ,y)  + 


y- 1 


rx  +  y  —  1 


pr(x0,yo]x,y  -  1), 


(C.10) 

for  x  >  xq,  y  >  yo  and  x+y  >  £o+Z/o+l,  with  the  boundary  condition  pr(x o,  yo ;  x,  y)  = 
0  for  x  <  xo  or  y  <  y0.  Note  that  we  have  replaced  the  one-step  transition  probabilities 
QcA,r(x  —  1  ,y;x,y)  and  QcA,r(x,y  —  1  ;x,y)  by  the  expressions  in  Eq.  (3.1). 


Proof  of  Lemma  C.2.1.  We  will  use  the  short-hand  notation  p(k,  h )  for  pr(x 0,  yo ;  xQ  + 
k,  yo+h),  and  ^(k,  h )  for  the  right-hand  side  of  Eq.  (C.6).  We  first  prove  the  boundary 
case  for  k  —  0.  By  Eq.  (C.10),  for  h  >  1, 


P(  0,  h) 


yo  +  h- l 

rx  o  +  yQ  +  h  - 


-p{f),h  -  1), 
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which  is  a  simple  recursion  in  h  and  can  be  expanded  to  yield 


P(0,h) 


(lJo)h 

(rx  o  +  y0)h 


p(  0,0) 


(yo)h 

(rx  o  +  y0)h 


^(0  ,h), 


which  yields  Eq.  (C.6)  for  k  =  0  and  h  >  1.  Here  we  have  used  p( 0,  0)  =  pr(x o,  yo ;  xy,  yo) 

1. 


Similarly,  for  the  other  boundary  case  h  —  0,  k  >  1,  we  have 


p(M) 


(^o)fc 

(x0  +  r~1y0)k 


p(  o,o) 


(*£o)fc  ^  (*£())fc 

(x0  +  r~1y0)k  ~  (rx „  +  y0)fc 


V>(M), 


where  the  last  inequality  is  because  (x)k  increases  with  x,  and  xq  +  r_lyo  <  rxo  +  yo- 
For  the  general  case,  we  use  induction  on  k+h.  The  base  case  k  +  h  =  1  is  already 
proven,  since  either  k  =  0  or  h  =  0  when  k  +  h  —  1.  Assume  Eq.  (C.6)  holds  for 
k  +  h  —  m  >  1.  Consider  /c  +  h  —  m  +  1.  We  can  also  assume  k  >  1  and  h  >  1,  since 
we  have  proven  the  boundary  cases  for  k  =  0  or  /?.  =  0.  The  recursion  in  Eq.  (C.10) 
yields 


p(k,h)  = 


> 


r(x0  +  k  —  1) 
rk  +  h  +  c0  —  r 
r(x  o  +  k  —  1) 
rk  +  h  +  c0  —  1 


y(/c  -  1,  h)  + 
p(k  -l,h)  + 


yo  +  h  —  l 
rk  +  ft,  +  c0  — 
l/o  +  h  —  1 
rk  +  h  +  c0  — 


yp(/c,  h  -  1) 


where  Co  =  rx o  +  yo- 

Applying  the  induction  hypothesis  p(k  —  l,h)  >  ijj( k  —  1  ,h)  and  p(k,h  —  1)  > 
^(k,  h  —  1)  to  the  above  inequality  yields 


n  i  ^  ^  r  +  k  -  i  , ,,  ,  , ,  y o  +  n- 

P{k ’  fc)  £  rt  +  ;i  +  Co-l^  -  !'  +  rt  +  /,  +  e„-l 

(rfc  +  /i)(fc  +  /i  +  c0-l)  .  . 

=  (fc  +  fe)(rt  +  fe+Co-i)^-A)- 
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where  in  the  last  step  we  have  used 


if(k  —  1,  h) 


k 

k  +  h 


k  +  h  +  c0  — 
xq  +  k  —  1 


1 


i>(k,  h), 


and 


f>(k,  h  —  1) 


h 

k  +  h 


k  +  h  +  Co  — 
Vo  +  h  -  1 


1 


ip(k,h). 


To  complete  the  proof,  it  suffices  to  show  that 


(■ rk  +  h)(k  +  h  +  c0-l)  > 

(k  +  h)(rk  +  h  +  c0  —  1)  ~  ’ 


but  this  is  equivalent  to  r  >  1,  which  is  true  by  assumption.  □ 

Proof  of  Lemma  C.2.2.  The  proof  of  Lemma  C.2.2  follows  the  same  line  of  reasoning 
as  that  used  to  prove  Lemma  C.2.1.  The  boundary  cases  can  be  verified  directly.  We 
only  outline  the  induction  step  here.  Applying  Eq.  (C.7)  to  the  right-hand  side  of 
Eq.  (C.10)  yields 


!  x  <  r(x o  +  k  -  1)  .  (x0)k-i(yo)h  +  y0  +  h  -  1  _  (x0)k(y0)h-i 

~  rk  +  h  +  c0  -  r  (r-^k-i^h  rk  +  h  +  c0  -  1  (r-1)*^)/*-! 
'  rjr-1  +  k  -  1)  cp  +  h-1  (x0)k(y0)h 

rk  +  h  +  c0  -  r  rk  +  h  +  c0  -  1 J  (r-1)fc(c0)/i ' 


Note  that 

r(r-1  +  k  —  1)  c0  +  h  —  1  r(r_1  +  k  —  1)  c0  +  h  —  1 

rk  +  h  +  Co  —  r  rk  +  h  +  cq  —  1  ~  rk  +  h  +  cq  —  r  rk  +  h  +  cq  —  r 


which  completes  the  induction.  □ 

Proof  of  Lemma  C.2.3.  Recall  that  An:2k(x,  x)  is  the  set  of  paths  that  start  from  (x,  x) 
at  time  0  and  end  with  the  n-th  tie  at  time  2k,  i.e.,  rn  =  2k.  Let  An^k  =  An  2fc(0,  0). 
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Note  that  the  paths  in  An^k{x,x)  are  exactly  the  paths  in  An^k  translated  by  (x,x). 
Let  7 f  G  An>2k  and  its  state  at  time  t  be  nt  =  (5y,2/t)-  The  translation  of  7 f  by  (x,x), 
denoted  x  +  n,  is  a  path  in  AHt2k(x,x),  whose  probability  in  the  CA  model  is  given 
by 


2k— 1 


pO,z) 


{x,x)  1  ,  ~-i  'i  f 

CA,rF  +  7r]  =11 


3=0 


r(x  +  x 


r(x  +  xj)  +  (x  +  yj) 


Xj  + 1 


(x  +  2/j) 


\  Vj+l-Vj 


r(x  +  +  (x  +  y.j) 


) 


For  fixed  k  and  7 f,  as  x  — *  00,  PjAf],  [x  +  7r]  converges  to 


®J+1 


r  +  1 


Vj+i-Vj 


p(0,0) 

rRW,r 


7T 


which  corresponds  to  the  probability  of  the  path  n  in  a  random  walk  with  parameter 
r/(r  +  1).  Thus,  as  x  — *  00, 


p(®>*) 

rCA,r 


Tn 


2k] 


Pc£[^  +  7r]  ->■  PRw!r[7r]  =  PRw!rtrn  =  2k]. 


After  summing  over  h  and  using  the  Dominated  Convergence  Theorem,  we  obtain 


oo 


pCp) 

rCA,r 


Tn  <  OO]  =  ^  P 


(x,x) 
CA  ,r 


rn 


k= 1 


2fc] 


OO 

-» V  p(0’0) 

^  RW,r 

fe=l 


Tn 


2fc]  =  P 


(0,0) 

RW,r 


Tn  <  OO  . 


In  particular, 


qr(x,x) 

where  we  have  used 


=  1  -  pca?[t2  <  oo]  ->  1  -  Prw ,l[r2  <  oo]  = 

Eq.  (3.3)  in  [38]  for  P[[^r [t2  <  oo]  in  the  last  step. 


□ 
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C.3  Proof  of  Theorem  3.4.3 


Recall  that  Anjt(xo,yo)  is  the  set  of  paths  starting  from  (xo,yo)  that  end  with 
the  n-th  tie  at  time  t,  i.e.,  Tn  =  t.  We  will  use  the  short-hand  notation  Anj  for 
An,t{x0,y0).  As  in  Section  C.l,  the  set  Anj  is  non-empty  only  if  t  =  \xQ  —  yo\  +  2k  for 
some  integer  k  >  n  —  1,  in  which  case,  every  path  in  Antt  ends  in  state  (zo  +  k,zo  +  k ) 
with  z0  =  niax{x0,  r/o}.  Recall  from  [6]  that  the  probability  of  any  path  n  connecting 
states  ( x0,y0 )  and  ( x,y )  is 


p(Ao,yo) 

rCA,l 


n 


B(x,y) 
B(x0,y0 ) 


B(x,  y)  Ofp(zo,t/o) 
B(x0,y0)  RW’X 


7 r  . 


Summing  over  n  G  An,t,  where  x  =  y  =  Zq  +  k,  we  obtain 


pfAO.J/o) 

rCA,l 


Tn 


t] 


B(z0  +  /c,  Zq  +  k ) 

B(x0,y0) 


oipCojj/o) 
z  rRW.l 


Tn  =  t]. 


(C.ll) 


Thus  the  probability  of  having  the  n-th  and  also  the  last  tie  at  time  t  is  given  by 


Pca,iA,)  [T  =  t,N  =  n]  =  Pca’iu)  [rn  =  t]  ■  qi (z0  +  k,z0  +  k) 

_  _ 1 _  .  1  p(so,i/o)r 

2*0+1/0-25(2^  yo)  t  +  Xo  +  yo  BW,l  » 


4, 


(C.12) 


where  we  have  used  Eqs.  (C.ll)  and  (C.3)  in  the  last  step.  Note  that  lTn  =  A  is 

the  probability  fn,t{do )  of  the  n-th  visit  to  the  origin  at  time  t  in  a  simple  symmetric 
random  walk  starting  from  do  =  |xq  —  yo\-  Summing  over  t  in  (C.12),  we  obtain 


,|Af  =  nl  =  ,.yo)  2 

-Gn(l-d0),  (C.13) 


2x°+v°-2B(xo,y0 ) 
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where 


OO 


Gn(z-,do)=  J2 


kz^_1  2/c  +  do  +  Xq  +  yo 
To  simplify  Gn(z;  d0),  we  have 


fn,do+2k(do)zd°+2k. 


^-[z*o+yoGn(z.do)] 


=  zxo+yo—i 


fn,do+2k(d0)zd0+2k  =  Z^-^Z-do), 


(C.14) 


k=n—  1 

where  $n(z;  do)  =  /n,d0+2fc(do)2d°+2fc  is  the  generating  function  of  the  proba¬ 

bility  distribution  of  the  n-th  visit  to  the  origin  in  a  simple  random  walk  starting  from 
do-  Let  Fi(z)  be  the  generating  function  of  the  distribution  of  the  time  of  the  first 
return  to  the  origin  in  a  simple  random  walk  starting  from  the  origin.  The  standard 
renewal  argument  (see  e.g.  XI. 3. d  of  [28])  shows  that  &n(z;d0)  is  given  by 


^n(z;d0)  =  [^1(z;l)]d°[F1(z)]n-1, 


where  $i(z;  1)  and  Fi(z)  are  given  by  Eqs.  (3.6)  and  (3.14)  of  [28,  Chap.  XI],  respec¬ 
tively.  Therefore, 

<M+  do)  =  Z~d0  (l  -  VT^J  •  (C.15) 

Substituting  Eq.  (C.15)  into  Eq.  (C.14)  and  integrating  from  0  to  1  yields 

/I  /  _ \  n+do  —  1 

z'2  mm{x0,J/o}—  1  ^ 

where  we  have  used  x0  +  yo  —  do  =  2  min{x0,  y0 }  •  A  change  of  variable  u  =  \/l  —  z2 
yields 

Gn(  1;  do)  =  [  u(  1  -  w2)min{xo,W,}-l(1  _  u)n+d0- 

Jo 

which  is  upper  bounded  by 


Gn(  1;  do)  < 


uy+d0-idu  =  5(2,  n  +  d0), 


(C.16) 
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and  lower  bounded  by 


Gn(  1;  do)  > 


w)min{*olW,}-l(1  _  uy^+do-ldu 


B  (2,  n  +  max{x0,  yo}  ~  1) , 


(C.17) 


where  we  have  used  min{a;o,  yo}  +  d0  =  max{x0,r/0}-  Applying  Eqs.  (C.16)  and  (C.17) 


to  Eq.  (C.13)  yields 


B(2,n  +  max{x0,y0}  -  1)  p(*0,w)r/v-  =  i  <  B(2,n  +  dp) 

2x°+yo-2B(x0,y0)  ~  CA>1  “  2x°+y°-2B(x0,y0)' 


Note  that  P^’^o)[lV  =  oo]  =  Pcai^IT  =  oo]  =  0.  Summing  over  n  and  using 
EL  b(2,  m)  =  rT1,  we  obtain 


_ <  p(xo,j/o)ny  >  m  < _ 

2x°+yo~2B(x0,  y0)  n  +  max{i0,  yo}  —  1  —  CA,:L  ^  —  —  2xo+yo-2B(x0,yo)  n  +  d0’ 


which  immediately  yields  Eq.  (3.6). 


C.4  Proof  of  Theorem  3.4.4 

We  first  prove  the  following  lemma. 


Lemma  C.4.1. 


The  probability  Pca^Iji 


<  oo]  of  ever  having  a  tie  is  bounded  as 


follows, 


■pl^OiJ/o) 

rCA,r 


[ti  <  oo]  < 


J1’ 

I  (VO  )xQ  —  yQ  i 

^  (rxo+yo)x0-y0  v  r) 


x0  <  yo, 


xo  >  yo- 


Note  that  P[h(''!Aj)  [ti  <  oo]  is  the  exit  probability  E(x,y )  in  [6]  when  r  =  1. 


Proof.  The  case  x0  =  y0  is  trivial  since  T\  =  0.  When  x0  <  yo,  Theorem  3.21  of  [42] 
yields  Yt/Xt  —>  0  almost  surely,  from  which  it  follows  that  Xt  >  Yt  eventually  and 
hence  P^a] 'yr°\Ti  <  oo]  =  1. 


Ill 


Now  assume  x0  >  y0.  Recall  that  Ait(x0,y0)  is  the  set  of  paths  starting  from 
(xo,yo)  that  end  with  the  hrst  tie  at  time  t.  Note  that  Ai:t(xo,yo)  is  nonempty  only 
if  t  =  d0  +  2k,  where  do  =  xq  —  yo  and  k  >  0.  Let  n  G  A^t{xo,  yo)  and  its  state  at 
time  j  be  7 Tj  =  ( x3,y3 ).  The  probability  of  the  path  n  is  given  by 


mteojj/o) 
rCA  ,r 


TXj 

rxj  +  yj 


Xj+ 1  X  j 


Vj 


rxj  +  yj 


rXt~Xo(x0)Xt_xo(y0)yt_yo 

El %l(rxj  +  Vj) 

rxt-xo(x0)Xt_Xo(y0)yt_yo 
n'=10[(r  -  1  )xj  +  Xo  +  yo+  j] ' 


yj+i-yj 


where  in  the  last  step  we  have  used  Xj  +  y3  =  x0  +  y0  +  j .  Note  that  [tt]  is 

maximized  if  the  Xj' s  are  minimized,  subject  to  the  constraints  that  the  x/ s  increase 
monotonically  from  xq  to  xt  with  step  size  0  or  1,  and  that  Xj  >  yj  for  all  1  <  j  <t  —  1, 
or  equivalently  x3  >  x0  +  (j  —  d0)/2.  This  is  achieved  by  the  following  sequence, 


x0, 


x:  =  < 


+  L U  ~  do)/2\  +  1, 


j  0, 1, ... ,  do  1, 
j  =  d0,  d0  +  1, . . .  ,t  -  1; 


xt,  j  =  t. 


The  corresponding  path  n*  has  probability 


do  2  .  xt  1  -j  -t 

p(zo,3/o)r*l  _  TT  Vo+J  TT  rx _ x  -  1 _ Xt  -  1 

CA,r  11  rx0  +  yo  +  j  4^"  rx  +  (x  —  1)  r(x  +  1)  +  (x  —  1)  rxt  +  (xt  —  1)  ’ 

J- — 0  X — xo 


which,  after  arrangement,  yields, 


do  — l  ,  ■  xt— i 

p(^o,yo)r*i  =  TT  Vo+J  TT  rx _ x _ 

cA,r  11  rx0  +  yo  +  j  4^"  (r  +  ^)x  +  r  (r  +  +  (r  —  1) 

<  (yo)d0 _ rXt~X0 

-  (rx 0  +  7/0 ) do  (r  +  1 )2(xt~xo) 
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r 


Xt—XQ 


1 


yt-yo 


{yo)do{r  +  l)d° 
(rx  o  +  j/o )  4, 


r  +  1 


_  Mrf0(r  +  l)d°  ]p(®0>2/o) 

(rso  +  i/o)*,  RW’r 


7T 


r  +  1 


Thus  we  have 


mOro.j/o) 

rCA,r 


7r]  <  P 


(®0>2/o) 
CA  ,r 


tt*]  < 


(yo)d0(r  +  l)d° 

pOu),yo) 

(rx  o  +  2/oU  RW’r 


7T 


and,  after  summing  over  7 r  G  -4i,t(xo,  2/o), 


ttd(^0,!/o) 

rCA,r 


[T1  =  *]  < 


(yo)d0(r+l)d° 

p(®o,j/o) 

(rx  o  +  jfo)*  RW’r 


[ri  =  t]. 


Summing  over  t,  we  obtain 


ipi^o.yo) 

rCA,r 


[ri  <  Oo]  < 


(yo)d0(r+  l)d° 

p(*o,J/o) 

(rx  o  +  j*,)*  RW’r 


[ri  <  oo  . 


By  Eq.  (3.9)  and  XI. 3. d  of  [28],  [r>  <  00 ]  —  r  d° ,  from  which  the  desired 

conclusion  follows.  □ 


Corollary  C.4.2.  The  probability  of  having  at  least  one  more  tie  starting  from  a  tie 
state  (x,  x )  is  bounded  by 

PcTrb2  <  oo]  < 

Proof.  By  considering  the  one-step  transition  from  (x,  x)  into  (x,  x  +  1)  or  (x  +  1,  x), 
we  obtain 


rCA,r 


[r2  <  oo 


r 

r  +  1 


p(x+l,x) 

rCA,r 


[ri  <  oo]  + 


1 

r  +  1 


p(x,x+l) 

rCA,r 


[ri  <  OO 


< 


X 

(r  +  l)x  +  r 


1  2 

- <  - . 

r  +  1  r  +  1 


where  the  first  inequality  follows  from  Lemma  C.4.1. 


□ 
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Now  we  prove  Theorem  3.4.4. 


Proof  of  Theorem  3.4-4-  Let  Zn  be  the  common  value  of  Xt  and  Yt  at  t  =  rn,  i.e., 
Zn  =  XTn.  Conditioned  on  rn  <  oo  and  Zn  =  z,  the  probability  of  rn+i  <  oo  is  just 
the  probability  of  having  a  tie  after  leaving  (z,  z ).  Thus 


FCA,r0)[Tn+l  <  OO  |  Tn  <  OO,  Z,, 


Z 


m(zX 

rCA,r 


[r2  <  oo]  < 


2 

r  +  1  ’ 


by  Corollary  C.4.2.  Removal  of  the  conditioning  yields 


Tn>{xo,yo) 
rCA  ,r 


[rn+ i  <  oo 


Tn  <  OO]  < 


2 

r  +  1 


It  follows  that 


P<3;»ol[/V  >  n]  =  P<3;“>[r„  <  oo] 

n—1 

=  PCA,r0)[rl  <  OO]  nPCA,r°)[r*+1  <  OO  |  Tj  <  CX)] 
i= 1 


An  application  of  Lemma  C.4.1  completes  the  proof. 


□ 
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APPENDIX  D 


ADDITIONAL  PROOFS  FOR  CHAPTER  4 


This  appendix  provides  proofs  of  the  theorems  in  Section  4.3.1. 


D.l  Proof  of  Theorem  4.3.1 

Let  pi  =  A-1  Yj=i  Xj.  Note  that  A_1Aj+i)j  =  1  —  pi  and  A_1Aj_i)j  =  Pi-i.  Also 
note  that  niji+ 1  =  i  and  n%^_\  —  n  —  i  +  1.  Thus  the  optimal  social  cost  in  (4.8)  can 
be  rewritten  as 


c*  =  r  Y  r  \Z*(f  Pi)  +  V(n  - i  +  l)pi-i 

Tl  L  _  Oj  - 
1=1 


> 


1 


(n  -  1  )K 


n 

Y  [\A(1  Pi)  +  V(n  -  i  +  l)Pi-l 

i=  1 


The  inequality  (x  +  y)2  >  x2  +  y2  for  xy  >  0  then  yields 


c*  > 


c° 

h  ’ 


(D.l) 


where 

1  n 

C°  =  -  Y  “  Pi)  +  (n~i  +  l)p*-i]  •  (D-2) 

i= 1 

Let  i*  =  min{i  :  p,  >  1/2}.  Since  Pi  is  increasing  in  i,  we  have  Pi  <  1/2  for  all  i  <  i* 
and  pi  >  1/2  for  all  i  >i*.  Therefore, 


C°  >  - 

n  —  1 


V  — 1  1  n  .. 

Y^i  +  Y  2(n~*  +  1) 

i  =  I  i.—r *-l-1 
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1 

2  (n  -  1) 


n  +  1  \  2  n2  —  1 

~J  +  ^“ 


> 


n  +  1 
8 


(D.3) 


which,  combined  with  (D.l),  yields  the  lower  bound  in  (4.9). 

The  cost  under  selfish  allocation  in  (4.6)  can  be  upper  bounded  as  follows, 


C  = 

< 


n  —  1  ^  b 

i= 1 


i- 1 


(n  —  l)6i 


—  ^  V1  “Pi  +  (n  -  i  +  1)V^IT 

:lin  *=i 


-  Pi  +  v'pm) 

-  Pi  +  \/P^i) 


1 

(— yj-E  [^  -?,)  +  («  -i  +  ife-l 
c°  +  cd 

~b~-  ’ 

^rmn 


+  (it  +  1 )  V7 Pi- 1  ( 1  -  Pi) 


(D.4) 


where  C°  is  as  in  (D.2)  and 


n—  1 


i= 2 


Since  A/pi-i(l  -  pi)  <  2  +  1  -  p*)  <  2  \ 


d  /  (n  +  l)(n-2) 


C"4  < 


2(n  -  1) 


(D.5) 


(D.6) 


For  an  upper  bound  on  C°,  we  again  use  pi  <  1/2  for  i  <  i*  and  p,  >  1/2  for  i  >  i*. 

Thus 


C°  =  j-  [Pi-iC71  -  i  +  1)  +  *(1  ^  Pi)] 
2=1 


1 

<  - 

n  —  1 


1 

2(n  -  1) 


5  n2  +  6n  +  1 
4 


n  +  1 
2 


2 
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D.2  Proof  of  Theorem  4.3.2 

Consider  the  case  of  homogeneous  content  creation  rates  with  A*  =  1.  Consider 
a  heterogeneous  set  of  budgets  of  allocation  where  bi  =  b\  >  b2,  for  all  i  ^  2.  The 
optimal  social  cost,  under  centralized  allocation,  is  given  by 


where  C°  is  as  in  (D.2).  The  social  cost  under  selfish  allocation  is 


where  Cd  is  as  in  (D.5).  By  (D.3),  (D.7)  and  (D.6),  C°  =  0(n)  and  Cd  =  0(n).  If 
b2/b1  =  D(n2),  then  C*  =  and  C  =  Q(b^ln~1/2).  Thus  PoS  =  C/C*  = 

0 ( \/n) ,  which  grows  unbounded  with  the  network  size. 


D.3  Proof  of  Theorem  4.3.3 

Since  b  changes  C*  and  C  only  by  a  multiplicative  factor,  it  suffices  to  prove  the 
theorem  for  b  =  1.  Let  i  be  the  hub  of  the  i-th  star,  and  Li  the  leaf  nodes  connected 
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to  it.  Then  riu  —  n  —  1  and  nu  =  1  for  any  l  G  Lj.  For  adjacent  hubs,  rq_ i,*  =  (i  —  l)p 
and  =  {k  —  i)p.  Then  the  optimal  social  cost  is  given  by 


c’-;^ry(£Dn“1)  +  Sl)-1“^  + 

v  '  \  i=l  leLi 


n  n(n  —  1)  ’ 


where 


*  =  E 


i=  1 


Y  Vn  -  1  +  Y  y(n~nji)n. 

j=i±  1 


,«ei 


Using  the  inequalities  a2  <  (EE<d  <  3Si=1ai)  we  obtain  <Si  = 
where 


/C  2 


5*2  =  k(p  —  l)2(n  —  1)  +  EE  P2j(k  —  j) 

2—1  j=i — 1 

=  (n  —  k)(p  —  1  )(n  —  1)  +  -n(nk  —  p)  —  @(n2  maxjp,  k}). 

o 


Thus  U*  =  0(max{p,  k})  =  D,(y/n). 

The  cost  under  selfish  allocation  can  be  written  as  follows, 


C=Wi-n(EE(«-1)  +  S4=1--+  & 

n[n  —  1)  1  11 


l£Li 


n  n(n  —  1)  ’ 


where 


s3  =  e 


Using  the  facts  that  |L*|  =  p  —  1,  <  n  and  the  inequality  (n  —  x)\fx  <  ;^n3/2  for 

x  G  [0, n],  we  obtain 


E1+  E 


m 


\l(zLi 


j=i±l 


E' 

,l£Li 


n 


i) 


E' 

j=i±i 


n 


k 

s3  <  Y(p+ 2Vw) 

2=1 


— n3/2^l 
3v/3  J 
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=  O  ((n  +  k\/n)  (pn  +  n3^2))  =  O  (n2  ma x{k,p})  . 

Thus  C  =  0(max{p,  k}).  Since  C  >  C*  =  0(max{p,  k}),  we  have  C  =  0(max{p,  k}) 
and  PoS  =  0(1). 


D.4  Proof  of  Theorem  4.3.4 

As  in  the  proof  of  Theorem  4.3.3,  it  suffices  to  consider  the  case  6  =  1.  Note  that 
A ji  =  riji .  The  optimal  social  cost  (4. 8)  and  cost  under  selfish  allocation  (4.6)  can  now 
be  written  as  follows, 


=  w(w  _  i)  E 1  E  v  )  ’ 


iEV  \ j£N(i) 


and 


C  = 


n(n  —  1) 


i&V 


E (n  ~ 

jeN(i) 


Tlji )  y  Tlji 


Let  the  depth  of  the  tree  be  h.  Then  n  =  E-E  •  Label  the  nodes  in  such  a  way 
that  that  for  i  =  1,  2, . . . ,  n,  p(i)  =  |"(i  —  1)/Af|  is  the  parent  of  i,  Cj(i )  =  k(i  —  1)  +  j  + 1 
is  the  j'-th  child  of  i,  for  j  =  1,2, ...  ,k.  Then  np^:i  =  AAALLT  an(]  nCj^:i  = 
kh  fc-i"1,  where  h(i)  =  [log k(ki  —  f)J  is  the  depth  of  node  i. 

By  (D.8)  ,  the  optimal  social  cost  is 


El  \A lji 


(D.8) 


(D.9) 


ueN(i) 


C*  = 


_ - _ y 

n(n  —  1)  ' 

v  '  1=1 


K, 

(jl  Ylp(i) ,i  T  ^  1  (ri  TlCj (i) , i ) Tlcj (j) ^ 


3= 1 


which  is  bounded  by 


— - -t(<S'i  +  k2S2 )  <  C*  <  — - -t(<Si  +  k2S2), 

n{n  —  1)  n(n  —  1) 

where 

n 

i= 1 
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and 


*5*2  ^  ^  ^  ^ci  (i),i)  ^ci  (*),*• 


Using  the  facts  that  n 
obtain 


i=  1 

fc_i  )  'wU 


h + 1  _  h  —  h  (  i  )  + 1 
k-V 


cind.  Tici(i),i 


fch  —  h(i) 

API  ’ 


we 


and 


s1  =  — — Y \kh~h(i)+i  _  i)(kh+1  -  kh~h^+1) 

'  '  i= 1 


—  kh\kh~h'+1  -  1  )(kh+1  - 

’  h'= 0 


{k 


_  hk  -  h  -  2  2fe+2  hk-h  +  2k  j  h+l 
(k-  l)3  (A;-l)3 

=  Q(hk2h )  =  @(n2logfcn), 


S2  = 


*  J^(A;h+1  -  _  i) 

'  '  i=l 


{k 


V  ApA;ft+1  -  kh~h)(kh~h'  -  1)  =  —S\. 

'  h'= 0 


Therefore, 


1  +  i  5,  <  C"  <  U  +  *Ui, 


n(n  —  1)  n(n  —  1) 

and  hence  C*  =  ©(ATog^n). 

By  (D.9),  the  social  cost  under  selfish  allocation  is 


C 


n(n 


,  \  ^  ^  [(^  ^p(i) ,i)  \J  ,i  T  k{n  ^ci(i),i)  \/^'i,ci(i)] 

'  i=l 


n(n  —  1) 


(5i  +  k2S2  +  A;  5*3), 
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where 


n 


i=l 


Note  that  2 n  >  2 n  —  nci^ti  —  np^}i  >  n,  and  n  >  np^ti  >  nj 2  for  i  ^  1,  so 


We  also  have 


S3 


0 


'Tl ci(i),i 


rk^\ 


y]  kh  V kh  h'  —  1  =  Q(n/k), 


ycilding  63  =  0 ( kn2  logfc  n  +  n5//2)  and  hence  C  =  Q^Ss/n2)  =  0(/clogfc  n  +  \/n). 
Taking  the  ratio  C/C*,  the  result  follows. 
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